Functional discrimination of membrane proteins using machine learning techniques

<p>Abstract</p> <p>Background</p> <p>Discriminating membrane proteins based on their functions is an important task in genome annotation. In this work, we have analyzed the characteristic features of amino acid residues in membrane proteins that perform major functions,...

Full description

Bibliographic Details
Main Authors: Yabuki Yukimitsu, Gromiha M Michael
Format: Article
Language:English
Published: BMC 2008-03-01
Series:BMC Bioinformatics
Online Access:http://www.biomedcentral.com/1471-2105/9/135
id doaj-abfc402de9bc44f8a4e57431d9096a88
record_format Article
spelling doaj-abfc402de9bc44f8a4e57431d9096a882020-11-25T00:30:19ZengBMCBMC Bioinformatics1471-21052008-03-019113510.1186/1471-2105-9-135Functional discrimination of membrane proteins using machine learning techniquesYabuki YukimitsuGromiha M Michael<p>Abstract</p> <p>Background</p> <p>Discriminating membrane proteins based on their functions is an important task in genome annotation. In this work, we have analyzed the characteristic features of amino acid residues in membrane proteins that perform major functions, such as channels/pores, electrochemical potential-driven transporters and primary active transporters.</p> <p>Results</p> <p>We observed that the residues Asp, Asn and Tyr are dominant in channels/pores whereas the composition of hydrophobic residues, Phe, Gly, Ile, Leu and Val is high in electrochemical potential-driven transporters. The composition of all the amino acids in primary active transporters lies in between other two classes of proteins. We have utilized different machine learning algorithms, such as, Bayes rule, Logistic function, Neural network, Support vector machine, Decision tree etc. for discriminating these classes of proteins. We observed that most of the algorithms have discriminated them with similar accuracy. The neural network method discriminated the channels/pores, electrochemical potential-driven transporters and active transporters with the 5-fold cross validation accuracy of 64% in a data set of 1718 membrane proteins. The application of amino acid occurrence improved the overall accuracy to 68%. In addition, we have discriminated transporters from other α-helical and β-barrel membrane proteins with the accuracy of 85% using k-nearest neighbor method. The classification of transporters and all other proteins (globular and membrane) showed the accuracy of 82%.</p> <p>Conclusion</p> <p>The performance of discrimination with amino acid occurrence is better than that with amino acid composition. We suggest that this method could be effectively used to discriminate transporters from all other globular and membrane proteins, and classify them into channels/pores, electrochemical and active transporters.</p> http://www.biomedcentral.com/1471-2105/9/135
collection DOAJ
language English
format Article
sources DOAJ
author Yabuki Yukimitsu
Gromiha M Michael
spellingShingle Yabuki Yukimitsu
Gromiha M Michael
Functional discrimination of membrane proteins using machine learning techniques
BMC Bioinformatics
author_facet Yabuki Yukimitsu
Gromiha M Michael
author_sort Yabuki Yukimitsu
title Functional discrimination of membrane proteins using machine learning techniques
title_short Functional discrimination of membrane proteins using machine learning techniques
title_full Functional discrimination of membrane proteins using machine learning techniques
title_fullStr Functional discrimination of membrane proteins using machine learning techniques
title_full_unstemmed Functional discrimination of membrane proteins using machine learning techniques
title_sort functional discrimination of membrane proteins using machine learning techniques
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2008-03-01
description <p>Abstract</p> <p>Background</p> <p>Discriminating membrane proteins based on their functions is an important task in genome annotation. In this work, we have analyzed the characteristic features of amino acid residues in membrane proteins that perform major functions, such as channels/pores, electrochemical potential-driven transporters and primary active transporters.</p> <p>Results</p> <p>We observed that the residues Asp, Asn and Tyr are dominant in channels/pores whereas the composition of hydrophobic residues, Phe, Gly, Ile, Leu and Val is high in electrochemical potential-driven transporters. The composition of all the amino acids in primary active transporters lies in between other two classes of proteins. We have utilized different machine learning algorithms, such as, Bayes rule, Logistic function, Neural network, Support vector machine, Decision tree etc. for discriminating these classes of proteins. We observed that most of the algorithms have discriminated them with similar accuracy. The neural network method discriminated the channels/pores, electrochemical potential-driven transporters and active transporters with the 5-fold cross validation accuracy of 64% in a data set of 1718 membrane proteins. The application of amino acid occurrence improved the overall accuracy to 68%. In addition, we have discriminated transporters from other α-helical and β-barrel membrane proteins with the accuracy of 85% using k-nearest neighbor method. The classification of transporters and all other proteins (globular and membrane) showed the accuracy of 82%.</p> <p>Conclusion</p> <p>The performance of discrimination with amino acid occurrence is better than that with amino acid composition. We suggest that this method could be effectively used to discriminate transporters from all other globular and membrane proteins, and classify them into channels/pores, electrochemical and active transporters.</p>
url http://www.biomedcentral.com/1471-2105/9/135
work_keys_str_mv AT yabukiyukimitsu functionaldiscriminationofmembraneproteinsusingmachinelearningtechniques
AT gromihammichael functionaldiscriminationofmembraneproteinsusingmachinelearningtechniques
_version_ 1725327346901188608