Protein function prediction by integrating sequence, structure and binding affinity information
Indiana University-Purdue University Indianapolis (IUPUI) === Proteins are nano-machines that work inside every living organism. Functional disruption of one or several proteins is the cause for many diseases. However, the functions for most proteins are yet to be annotated because inexpensive seque...
Main Author: | |
---|---|
Other Authors: | |
Language: | en_US |
Published: |
2014
|
Subjects: | |
Online Access: | http://hdl.handle.net/1805/3913 |
id |
ndltd-IUPUI-oai-scholarworks.iupui.edu-1805-3913 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-IUPUI-oai-scholarworks.iupui.edu-1805-39132019-05-10T15:21:25Z Protein function prediction by integrating sequence, structure and binding affinity information Zhao, Huiying Zhou, Yaoqi Liu, Yunlong Meroueh, Samy Janga, Sarath Chandra protein function Proteomics -- Data processing -- Research Proteins -- Analysis -- Mathematics -- Research Artificial intelligence Algorithms Proteins -- Structure-activity relationships Protein-protein interactions Genomics -- Data processing Proteins -- Analysis Biologically-inspired computing -- Research Expert systems (Computer science) Data mining Bioinformatics -- Research DNA-protein interactions RNA-protein interactions Carbohydrates Indiana University-Purdue University Indianapolis (IUPUI) Proteins are nano-machines that work inside every living organism. Functional disruption of one or several proteins is the cause for many diseases. However, the functions for most proteins are yet to be annotated because inexpensive sequencing techniques dramatically speed up discovery of new protein sequences (265 million and counting) and experimental examinations of every protein in all its possible functional categories are simply impractical. Thus, it is necessary to develop computational function-prediction tools that complement and guide experimental studies. In this study, we developed a series of predictors for highly accurate prediction of proteins with DNA-binding, RNA-binding and carbohydrate-binding capability. These predictors are a template-based technique that combines sequence and structural information with predicted binding affinity. Both sequence and structure-based approaches were developed. Results indicate the importance of binding affinity prediction for improving sensitivity and precision of function prediction. Application of these methods to the human genome and structure genome targets demonstrated its usefulness in annotating proteins of unknown functions and discovering moon-lighting proteins with DNA,RNA, or carbohydrate binding function. In addition, we also investigated disruption of protein functions by naturally occurring genetic variations due to insertions and deletions (INDELS). We found that protein structures are the most critical features in recognising disease-causing non-frame shifting INDELs. The predictors for function predictions are available at http://sparks-lab.org/spot, and the predictor for classification of non-frame shifting INDELs is available at http://sparks-lab.org/ddig. 2014-02-03T18:13:42Z 2014-02-03T18:13:42Z 2014-02-03 Thesis http://hdl.handle.net/1805/3913 en_US |
collection |
NDLTD |
language |
en_US |
sources |
NDLTD |
topic |
protein function Proteomics -- Data processing -- Research Proteins -- Analysis -- Mathematics -- Research Artificial intelligence Algorithms Proteins -- Structure-activity relationships Protein-protein interactions Genomics -- Data processing Proteins -- Analysis Biologically-inspired computing -- Research Expert systems (Computer science) Data mining Bioinformatics -- Research DNA-protein interactions RNA-protein interactions Carbohydrates |
spellingShingle |
protein function Proteomics -- Data processing -- Research Proteins -- Analysis -- Mathematics -- Research Artificial intelligence Algorithms Proteins -- Structure-activity relationships Protein-protein interactions Genomics -- Data processing Proteins -- Analysis Biologically-inspired computing -- Research Expert systems (Computer science) Data mining Bioinformatics -- Research DNA-protein interactions RNA-protein interactions Carbohydrates Zhao, Huiying Protein function prediction by integrating sequence, structure and binding affinity information |
description |
Indiana University-Purdue University Indianapolis (IUPUI) === Proteins are nano-machines that work inside every living organism. Functional disruption of one or several proteins is the cause for many diseases. However, the functions for most proteins are yet to be annotated because inexpensive sequencing techniques dramatically speed up discovery of new protein sequences (265 million and counting) and experimental examinations of every protein in all its possible functional categories are simply impractical. Thus, it is necessary to develop computational function-prediction tools that complement and guide experimental studies. In this study, we developed a series of predictors for highly accurate prediction of proteins with DNA-binding, RNA-binding and carbohydrate-binding capability. These predictors are a template-based technique that combines sequence and structural information with predicted binding affinity. Both sequence and structure-based approaches were developed. Results indicate the importance of binding affinity prediction for improving sensitivity and precision of function prediction. Application of these methods to the human genome and structure genome targets demonstrated its usefulness in annotating proteins of unknown functions and discovering moon-lighting proteins with DNA,RNA, or carbohydrate binding function. In addition, we also investigated disruption of protein functions by naturally occurring genetic variations due to insertions and deletions (INDELS). We found that protein structures are the most critical features in recognising disease-causing non-frame shifting INDELs. The predictors for function predictions are available at http://sparks-lab.org/spot, and the predictor for classification of non-frame shifting INDELs is available at http://sparks-lab.org/ddig. |
author2 |
Zhou, Yaoqi |
author_facet |
Zhou, Yaoqi Zhao, Huiying |
author |
Zhao, Huiying |
author_sort |
Zhao, Huiying |
title |
Protein function prediction by integrating sequence, structure and binding affinity information |
title_short |
Protein function prediction by integrating sequence, structure and binding affinity information |
title_full |
Protein function prediction by integrating sequence, structure and binding affinity information |
title_fullStr |
Protein function prediction by integrating sequence, structure and binding affinity information |
title_full_unstemmed |
Protein function prediction by integrating sequence, structure and binding affinity information |
title_sort |
protein function prediction by integrating sequence, structure and binding affinity information |
publishDate |
2014 |
url |
http://hdl.handle.net/1805/3913 |
work_keys_str_mv |
AT zhaohuiying proteinfunctionpredictionbyintegratingsequencestructureandbindingaffinityinformation |
_version_ |
1719080247581736960 |