A combinatorial optimization approach for diverse motif finding applications
<p>Abstract</p> <p>Background</p> <p>Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2006-08-01
|
Series: | Algorithms for Molecular Biology |
Online Access: | http://www.almob.org/content/1/1/13 |
id |
doaj-ab402160149d4399ab4ead6b6d98e376 |
---|---|
record_format |
Article |
spelling |
doaj-ab402160149d4399ab4ead6b6d98e3762020-11-24T23:01:48ZengBMCAlgorithms for Molecular Biology1748-71882006-08-01111310.1186/1748-7188-1-13A combinatorial optimization approach for diverse motif finding applicationsSingh MonaZaslavsky Elena<p>Abstract</p> <p>Background</p> <p>Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared regulatory signals within DNA sequences or shared functional and structural elements within protein sequences. Due to the diversity of contexts in which motif finding is applied, several variations of the problem are commonly studied.</p> <p>Results</p> <p>We introduce a versatile combinatorial optimization framework for motif finding that couples graph pruning techniques with a novel integer linear programming formulation. Our approach is flexible and robust enough to model several variants of the motif finding problem, including those incorporating substitution matrices and phylogenetic distances. Additionally, we give an approach for determining statistical significance of uncovered motifs. In testing on numerous DNA and protein datasets, we demonstrate that our approach typically identifies statistically significant motifs corresponding to either known motifs or other motifs of high conservation. Moreover, in most cases, our approach finds provably optimal solutions to the underlying optimization problem.</p> <p>Conclusion</p> <p>Our results demonstrate that a combined graph theoretic and mathematical programming approach can be the basis for effective and powerful techniques for diverse motif finding applications.</p> http://www.almob.org/content/1/1/13 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Singh Mona Zaslavsky Elena |
spellingShingle |
Singh Mona Zaslavsky Elena A combinatorial optimization approach for diverse motif finding applications Algorithms for Molecular Biology |
author_facet |
Singh Mona Zaslavsky Elena |
author_sort |
Singh Mona |
title |
A combinatorial optimization approach for diverse motif finding applications |
title_short |
A combinatorial optimization approach for diverse motif finding applications |
title_full |
A combinatorial optimization approach for diverse motif finding applications |
title_fullStr |
A combinatorial optimization approach for diverse motif finding applications |
title_full_unstemmed |
A combinatorial optimization approach for diverse motif finding applications |
title_sort |
combinatorial optimization approach for diverse motif finding applications |
publisher |
BMC |
series |
Algorithms for Molecular Biology |
issn |
1748-7188 |
publishDate |
2006-08-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared regulatory signals within DNA sequences or shared functional and structural elements within protein sequences. Due to the diversity of contexts in which motif finding is applied, several variations of the problem are commonly studied.</p> <p>Results</p> <p>We introduce a versatile combinatorial optimization framework for motif finding that couples graph pruning techniques with a novel integer linear programming formulation. Our approach is flexible and robust enough to model several variants of the motif finding problem, including those incorporating substitution matrices and phylogenetic distances. Additionally, we give an approach for determining statistical significance of uncovered motifs. In testing on numerous DNA and protein datasets, we demonstrate that our approach typically identifies statistically significant motifs corresponding to either known motifs or other motifs of high conservation. Moreover, in most cases, our approach finds provably optimal solutions to the underlying optimization problem.</p> <p>Conclusion</p> <p>Our results demonstrate that a combined graph theoretic and mathematical programming approach can be the basis for effective and powerful techniques for diverse motif finding applications.</p> |
url |
http://www.almob.org/content/1/1/13 |
work_keys_str_mv |
AT singhmona acombinatorialoptimizationapproachfordiversemotiffindingapplications AT zaslavskyelena acombinatorialoptimizationapproachfordiversemotiffindingapplications AT singhmona combinatorialoptimizationapproachfordiversemotiffindingapplications AT zaslavskyelena combinatorialoptimizationapproachfordiversemotiffindingapplications |
_version_ |
1725638738829115392 |