Protein function prediction by integrating sequence, structure and binding affinity information

Indiana University-Purdue University Indianapolis (IUPUI) === Proteins are nano-machines that work inside every living organism. Functional disruption of one or several proteins is the cause for many diseases. However, the functions for most proteins are yet to be annotated because inexpensive seque...

Full description

Bibliographic Details
Main Author: Zhao, Huiying
Other Authors: Zhou, Yaoqi
Language:en_US
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/1805/3913
id ndltd-IUPUI-oai-scholarworks.iupui.edu-1805-3913
record_format oai_dc
spelling ndltd-IUPUI-oai-scholarworks.iupui.edu-1805-39132019-05-10T15:21:25Z Protein function prediction by integrating sequence, structure and binding affinity information Zhao, Huiying Zhou, Yaoqi Liu, Yunlong Meroueh, Samy Janga, Sarath Chandra protein function Proteomics -- Data processing -- Research Proteins -- Analysis -- Mathematics -- Research Artificial intelligence Algorithms Proteins -- Structure-activity relationships Protein-protein interactions Genomics -- Data processing Proteins -- Analysis Biologically-inspired computing -- Research Expert systems (Computer science) Data mining Bioinformatics -- Research DNA-protein interactions RNA-protein interactions Carbohydrates Indiana University-Purdue University Indianapolis (IUPUI) Proteins are nano-machines that work inside every living organism. Functional disruption of one or several proteins is the cause for many diseases. However, the functions for most proteins are yet to be annotated because inexpensive sequencing techniques dramatically speed up discovery of new protein sequences (265 million and counting) and experimental examinations of every protein in all its possible functional categories are simply impractical. Thus, it is necessary to develop computational function-prediction tools that complement and guide experimental studies. In this study, we developed a series of predictors for highly accurate prediction of proteins with DNA-binding, RNA-binding and carbohydrate-binding capability. These predictors are a template-based technique that combines sequence and structural information with predicted binding affinity. Both sequence and structure-based approaches were developed. Results indicate the importance of binding affinity prediction for improving sensitivity and precision of function prediction. Application of these methods to the human genome and structure genome targets demonstrated its usefulness in annotating proteins of unknown functions and discovering moon-lighting proteins with DNA,RNA, or carbohydrate binding function. In addition, we also investigated disruption of protein functions by naturally occurring genetic variations due to insertions and deletions (INDELS). We found that protein structures are the most critical features in recognising disease-causing non-frame shifting INDELs. The predictors for function predictions are available at http://sparks-lab.org/spot, and the predictor for classification of non-frame shifting INDELs is available at http://sparks-lab.org/ddig. 2014-02-03T18:13:42Z 2014-02-03T18:13:42Z 2014-02-03 Thesis http://hdl.handle.net/1805/3913 en_US
collection NDLTD
language en_US
sources NDLTD
topic protein function
Proteomics -- Data processing -- Research
Proteins -- Analysis -- Mathematics -- Research
Artificial intelligence
Algorithms
Proteins -- Structure-activity relationships
Protein-protein interactions
Genomics -- Data processing
Proteins -- Analysis
Biologically-inspired computing -- Research
Expert systems (Computer science)
Data mining
Bioinformatics -- Research
DNA-protein interactions
RNA-protein interactions
Carbohydrates
spellingShingle protein function
Proteomics -- Data processing -- Research
Proteins -- Analysis -- Mathematics -- Research
Artificial intelligence
Algorithms
Proteins -- Structure-activity relationships
Protein-protein interactions
Genomics -- Data processing
Proteins -- Analysis
Biologically-inspired computing -- Research
Expert systems (Computer science)
Data mining
Bioinformatics -- Research
DNA-protein interactions
RNA-protein interactions
Carbohydrates
Zhao, Huiying
Protein function prediction by integrating sequence, structure and binding affinity information
description Indiana University-Purdue University Indianapolis (IUPUI) === Proteins are nano-machines that work inside every living organism. Functional disruption of one or several proteins is the cause for many diseases. However, the functions for most proteins are yet to be annotated because inexpensive sequencing techniques dramatically speed up discovery of new protein sequences (265 million and counting) and experimental examinations of every protein in all its possible functional categories are simply impractical. Thus, it is necessary to develop computational function-prediction tools that complement and guide experimental studies. In this study, we developed a series of predictors for highly accurate prediction of proteins with DNA-binding, RNA-binding and carbohydrate-binding capability. These predictors are a template-based technique that combines sequence and structural information with predicted binding affinity. Both sequence and structure-based approaches were developed. Results indicate the importance of binding affinity prediction for improving sensitivity and precision of function prediction. Application of these methods to the human genome and structure genome targets demonstrated its usefulness in annotating proteins of unknown functions and discovering moon-lighting proteins with DNA,RNA, or carbohydrate binding function. In addition, we also investigated disruption of protein functions by naturally occurring genetic variations due to insertions and deletions (INDELS). We found that protein structures are the most critical features in recognising disease-causing non-frame shifting INDELs. The predictors for function predictions are available at http://sparks-lab.org/spot, and the predictor for classification of non-frame shifting INDELs is available at http://sparks-lab.org/ddig.
author2 Zhou, Yaoqi
author_facet Zhou, Yaoqi
Zhao, Huiying
author Zhao, Huiying
author_sort Zhao, Huiying
title Protein function prediction by integrating sequence, structure and binding affinity information
title_short Protein function prediction by integrating sequence, structure and binding affinity information
title_full Protein function prediction by integrating sequence, structure and binding affinity information
title_fullStr Protein function prediction by integrating sequence, structure and binding affinity information
title_full_unstemmed Protein function prediction by integrating sequence, structure and binding affinity information
title_sort protein function prediction by integrating sequence, structure and binding affinity information
publishDate 2014
url http://hdl.handle.net/1805/3913
work_keys_str_mv AT zhaohuiying proteinfunctionpredictionbyintegratingsequencestructureandbindingaffinityinformation
_version_ 1719080247581736960