Finding regulatory elements and regulatory motifs: a general probabilistic framework

<p>Abstract</p> <p>Over the last two decades a large number of algorithms has been developed for regulatory motif finding. Here we show how many of these algorithms, especially those that model binding specificities of regulatory factors with position specific weight matrices (WMs)...

Full description

Bibliographic Details
Main Author:	van Nimwegen Erik
Format:	Article
Language:	English
Published:	BMC 2007-09-01
Series:	BMC Bioinformatics
Online Access:	http://www.biomedcentral.com/1471-2105/8/S6/S4

id	doaj-e35f36d50f604ac5adc19cdcb8cd728f
record_format	Article
spelling	doaj-e35f36d50f604ac5adc19cdcb8cd728f2020-11-24T21:33:42ZengBMCBMC Bioinformatics1471-21052007-09-018Suppl 6S410.1186/1471-2105-8-S6-S4Finding regulatory elements and regulatory motifs: a general probabilistic frameworkvan Nimwegen Erik<p>Abstract</p> <p>Over the last two decades a large number of algorithms has been developed for regulatory motif finding. Here we show how many of these algorithms, especially those that model binding specificities of regulatory factors with position specific weight matrices (WMs), naturally arise within a general Bayesian probabilistic framework. We discuss how WMs are constructed from sets of regulatory sites, how sites for a given WM can be discovered by scanning of large sequences, how to cluster WMs, and more generally how to cluster large sets of sites from different WMs into clusters. We discuss how 'regulatory modules', clusters of sites for subsets of WMs, can be found in large intergenic sequences, and we discuss different methods for <it>ab initio </it>motif finding, including expectation maximization (EM) algorithms, and motif sampling algorithms. Finally, we extensively discuss how module finding methods and <it>ab initio </it>motif finding methods can be extended to take phylogenetic relations between the input sequences into account, i.e. we show how motif finding and phylogenetic footprinting can be integrated in a rigorous probabilistic framework. The article is intended for readers with a solid background in applied mathematics, and preferably with some knowledge of general Bayesian probabilistic methods. The main purpose of the article is to elucidate that all these methods are not a disconnected set of individual algorithmic recipes, but that they are just different facets of a single integrated probabilistic theory.</p> http://www.biomedcentral.com/1471-2105/8/S6/S4
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	van Nimwegen Erik
spellingShingle	van Nimwegen Erik Finding regulatory elements and regulatory motifs: a general probabilistic framework BMC Bioinformatics
author_facet	van Nimwegen Erik
author_sort	van Nimwegen Erik
title	Finding regulatory elements and regulatory motifs: a general probabilistic framework
title_short	Finding regulatory elements and regulatory motifs: a general probabilistic framework
title_full	Finding regulatory elements and regulatory motifs: a general probabilistic framework
title_fullStr	Finding regulatory elements and regulatory motifs: a general probabilistic framework
title_full_unstemmed	Finding regulatory elements and regulatory motifs: a general probabilistic framework
title_sort	finding regulatory elements and regulatory motifs: a general probabilistic framework
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2007-09-01
description	<p>Abstract</p> <p>Over the last two decades a large number of algorithms has been developed for regulatory motif finding. Here we show how many of these algorithms, especially those that model binding specificities of regulatory factors with position specific weight matrices (WMs), naturally arise within a general Bayesian probabilistic framework. We discuss how WMs are constructed from sets of regulatory sites, how sites for a given WM can be discovered by scanning of large sequences, how to cluster WMs, and more generally how to cluster large sets of sites from different WMs into clusters. We discuss how 'regulatory modules', clusters of sites for subsets of WMs, can be found in large intergenic sequences, and we discuss different methods for <it>ab initio </it>motif finding, including expectation maximization (EM) algorithms, and motif sampling algorithms. Finally, we extensively discuss how module finding methods and <it>ab initio </it>motif finding methods can be extended to take phylogenetic relations between the input sequences into account, i.e. we show how motif finding and phylogenetic footprinting can be integrated in a rigorous probabilistic framework. The article is intended for readers with a solid background in applied mathematics, and preferably with some knowledge of general Bayesian probabilistic methods. The main purpose of the article is to elucidate that all these methods are not a disconnected set of individual algorithmic recipes, but that they are just different facets of a single integrated probabilistic theory.</p>
url	http://www.biomedcentral.com/1471-2105/8/S6/S4
work_keys_str_mv	AT vannimwegenerik findingregulatoryelementsandregulatorymotifsageneralprobabilisticframework
_version_	1725952384427884544

Finding regulatory elements and regulatory motifs: a general probabilistic framework

Similar Items