Efficient Identification of Transcription Factor Binding Sites with a Graph Theoretic Approach

Identifying transcription factor binding sites with experimental methods is often expensive and time consuming. Although many computational approaches and tools have been developed for this problem, the prediction accuracy is not satisfactory. In this paper, we develop a new computational approach t...

Full description

Bibliographic Details
Main Authors: Jia Song, Li Xu, Hong Sun
Format: Article
Language:English
Published: Hindawi Limited 2013-01-01
Series:Computational and Mathematical Methods in Medicine
Online Access:http://dx.doi.org/10.1155/2013/856281
id doaj-c2ca16c807b3465aa1cd23298f5fbcd3
record_format Article
spelling doaj-c2ca16c807b3465aa1cd23298f5fbcd32020-11-25T00:14:37ZengHindawi LimitedComputational and Mathematical Methods in Medicine1748-670X1748-67182013-01-01201310.1155/2013/856281856281Efficient Identification of Transcription Factor Binding Sites with a Graph Theoretic ApproachJia Song0Li Xu1Hong Sun2College of Electrical Engineering, Zhejiang University, Hangzhou 310027, ChinaCollege of Electrical Engineering, Zhejiang University, Hangzhou 310027, ChinaDepartment of Electronic and Information Technology, Suzhou Vocational University, Suzhou 215104, ChinaIdentifying transcription factor binding sites with experimental methods is often expensive and time consuming. Although many computational approaches and tools have been developed for this problem, the prediction accuracy is not satisfactory. In this paper, we develop a new computational approach that can model the relationships among all short sequence segments in the promoter regions with a graph theoretic model. Based on this model, finding the locations of transcription factor binding site is reduced to computing maximum weighted cliques in a graph with weighted edges. We have implemented this approach and used it to predict the binding sites in two organisms, Caenorhabditis elegans and mus musculus. We compared the prediction accuracy with that of the Gibbs Motif Sampler. We found that the accuracy of our approach is higher than or comparable with that of the Gibbs Motif Sampler for most of tested data and can accurately identify binding sites in cases where the Gibbs Motif Sampler has difficulty to predict their locations.http://dx.doi.org/10.1155/2013/856281
collection DOAJ
language English
format Article
sources DOAJ
author Jia Song
Li Xu
Hong Sun
spellingShingle Jia Song
Li Xu
Hong Sun
Efficient Identification of Transcription Factor Binding Sites with a Graph Theoretic Approach
Computational and Mathematical Methods in Medicine
author_facet Jia Song
Li Xu
Hong Sun
author_sort Jia Song
title Efficient Identification of Transcription Factor Binding Sites with a Graph Theoretic Approach
title_short Efficient Identification of Transcription Factor Binding Sites with a Graph Theoretic Approach
title_full Efficient Identification of Transcription Factor Binding Sites with a Graph Theoretic Approach
title_fullStr Efficient Identification of Transcription Factor Binding Sites with a Graph Theoretic Approach
title_full_unstemmed Efficient Identification of Transcription Factor Binding Sites with a Graph Theoretic Approach
title_sort efficient identification of transcription factor binding sites with a graph theoretic approach
publisher Hindawi Limited
series Computational and Mathematical Methods in Medicine
issn 1748-670X
1748-6718
publishDate 2013-01-01
description Identifying transcription factor binding sites with experimental methods is often expensive and time consuming. Although many computational approaches and tools have been developed for this problem, the prediction accuracy is not satisfactory. In this paper, we develop a new computational approach that can model the relationships among all short sequence segments in the promoter regions with a graph theoretic model. Based on this model, finding the locations of transcription factor binding site is reduced to computing maximum weighted cliques in a graph with weighted edges. We have implemented this approach and used it to predict the binding sites in two organisms, Caenorhabditis elegans and mus musculus. We compared the prediction accuracy with that of the Gibbs Motif Sampler. We found that the accuracy of our approach is higher than or comparable with that of the Gibbs Motif Sampler for most of tested data and can accurately identify binding sites in cases where the Gibbs Motif Sampler has difficulty to predict their locations.
url http://dx.doi.org/10.1155/2013/856281
work_keys_str_mv AT jiasong efficientidentificationoftranscriptionfactorbindingsiteswithagraphtheoreticapproach
AT lixu efficientidentificationoftranscriptionfactorbindingsiteswithagraphtheoreticapproach
AT hongsun efficientidentificationoftranscriptionfactorbindingsiteswithagraphtheoreticapproach
_version_ 1725389687820910592