A dynamic Bayesian network approach to protein secondary structure prediction
<p>Abstract</p> <p>Background</p> <p>Protein secondary structure prediction method based on probabilistic models such as hidden Markov model (HMM) appeals to many because it provides meaningful information relevant to sequence-structure relationship. However, at present...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2008-01-01
|
Series: | BMC Bioinformatics |
Online Access: | http://www.biomedcentral.com/1471-2105/9/49 |
id |
doaj-c5e2078f49e94821a7edaa04beebb49d |
---|---|
record_format |
Article |
spelling |
doaj-c5e2078f49e94821a7edaa04beebb49d2020-11-24T21:58:28ZengBMCBMC Bioinformatics1471-21052008-01-01914910.1186/1471-2105-9-49A dynamic Bayesian network approach to protein secondary structure predictionZhu HuaiqiuYao Xin-QiuShe Zhen-Su<p>Abstract</p> <p>Background</p> <p>Protein secondary structure prediction method based on probabilistic models such as hidden Markov model (HMM) appeals to many because it provides meaningful information relevant to sequence-structure relationship. However, at present, the prediction accuracy of pure HMM-type methods is much lower than that of machine learning-based methods such as neural networks (NN) or support vector machines (SVM).</p> <p>Results</p> <p>In this paper, we report a new method of probabilistic nature for protein secondary structure prediction, based on dynamic Bayesian networks (DBN). The new method models the PSI-BLAST profile of a protein sequence using a multivariate Gaussian distribution, and simultaneously takes into account the dependency between the profile and secondary structure and the dependency between profiles of neighboring residues. In addition, a segment length distribution is introduced for each secondary structure state. Tests show that the DBN method has made a significant improvement in the accuracy compared to other pure HMM-type methods. Further improvement is achieved by combining the DBN with an NN, a method called DBNN, which shows better <it>Q</it><sub>3 </sub>accuracy than many popular methods and is competitive to the current state-of-the-arts. The most interesting feature of DBN/DBNN is that a significant improvement in the prediction accuracy is achieved when combined with other methods by a simple consensus.</p> <p>Conclusion</p> <p>The DBN method using a Gaussian distribution for the PSI-BLAST profile and a high-ordered dependency between profiles of neighboring residues produces significantly better prediction accuracy than other HMM-type probabilistic methods. Owing to their different nature, the DBN and NN combine to form a more accurate method DBNN. Future improvement may be achieved by combining DBNN with a method of SVM type.</p> http://www.biomedcentral.com/1471-2105/9/49 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhu Huaiqiu Yao Xin-Qiu She Zhen-Su |
spellingShingle |
Zhu Huaiqiu Yao Xin-Qiu She Zhen-Su A dynamic Bayesian network approach to protein secondary structure prediction BMC Bioinformatics |
author_facet |
Zhu Huaiqiu Yao Xin-Qiu She Zhen-Su |
author_sort |
Zhu Huaiqiu |
title |
A dynamic Bayesian network approach to protein secondary structure prediction |
title_short |
A dynamic Bayesian network approach to protein secondary structure prediction |
title_full |
A dynamic Bayesian network approach to protein secondary structure prediction |
title_fullStr |
A dynamic Bayesian network approach to protein secondary structure prediction |
title_full_unstemmed |
A dynamic Bayesian network approach to protein secondary structure prediction |
title_sort |
dynamic bayesian network approach to protein secondary structure prediction |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2008-01-01 |
description |
<p>Abstract</p> <p>Background</p> <p>Protein secondary structure prediction method based on probabilistic models such as hidden Markov model (HMM) appeals to many because it provides meaningful information relevant to sequence-structure relationship. However, at present, the prediction accuracy of pure HMM-type methods is much lower than that of machine learning-based methods such as neural networks (NN) or support vector machines (SVM).</p> <p>Results</p> <p>In this paper, we report a new method of probabilistic nature for protein secondary structure prediction, based on dynamic Bayesian networks (DBN). The new method models the PSI-BLAST profile of a protein sequence using a multivariate Gaussian distribution, and simultaneously takes into account the dependency between the profile and secondary structure and the dependency between profiles of neighboring residues. In addition, a segment length distribution is introduced for each secondary structure state. Tests show that the DBN method has made a significant improvement in the accuracy compared to other pure HMM-type methods. Further improvement is achieved by combining the DBN with an NN, a method called DBNN, which shows better <it>Q</it><sub>3 </sub>accuracy than many popular methods and is competitive to the current state-of-the-arts. The most interesting feature of DBN/DBNN is that a significant improvement in the prediction accuracy is achieved when combined with other methods by a simple consensus.</p> <p>Conclusion</p> <p>The DBN method using a Gaussian distribution for the PSI-BLAST profile and a high-ordered dependency between profiles of neighboring residues produces significantly better prediction accuracy than other HMM-type probabilistic methods. Owing to their different nature, the DBN and NN combine to form a more accurate method DBNN. Future improvement may be achieved by combining DBNN with a method of SVM type.</p> |
url |
http://www.biomedcentral.com/1471-2105/9/49 |
work_keys_str_mv |
AT zhuhuaiqiu adynamicbayesiannetworkapproachtoproteinsecondarystructureprediction AT yaoxinqiu adynamicbayesiannetworkapproachtoproteinsecondarystructureprediction AT shezhensu adynamicbayesiannetworkapproachtoproteinsecondarystructureprediction AT zhuhuaiqiu dynamicbayesiannetworkapproachtoproteinsecondarystructureprediction AT yaoxinqiu dynamicbayesiannetworkapproachtoproteinsecondarystructureprediction AT shezhensu dynamicbayesiannetworkapproachtoproteinsecondarystructureprediction |
_version_ |
1725851787611602944 |