數據驅動的蛋白質對接

博士 === 國立清華大學 === 生物資訊與結構生物研究所 === 100 === Protein-protein docking (PPD), a computational method for predicting the structure of a protein complex from known component structures, has the ability to reveal atomic details of the complex structure and make otherwise unattainable discoveries. Among dif...

Full description

Bibliographic Details
Main Authors: Shih, Edward S.C., 施宣誠
Other Authors: Hwang, Ming-Jing
Format: Others
Language:en_US
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/88750082076141307800
id ndltd-TW-100NTHU5112009
record_format oai_dc
spelling ndltd-TW-100NTHU51120092016-04-04T04:17:09Z http://ndltd.ncl.edu.tw/handle/88750082076141307800 數據驅動的蛋白質對接 Data-driven Protein-Protein Docking Shih, Edward S.C. 施宣誠 博士 國立清華大學 生物資訊與結構生物研究所 100 Protein-protein docking (PPD), a computational method for predicting the structure of a protein complex from known component structures, has the ability to reveal atomic details of the complex structure and make otherwise unattainable discoveries. Among different kinds of PPD methods developed, one conclusion from monitoring their progress over the years is that data-driven PPD is particularly useful because it can significantly increase the low success rates of PPD, albeit requiring the incorporation of experimental data or information as constraints for the docking. It is unclear, however, how vulnerable of data-driven PPD is to incorrect or ambiguous data, answers to which will provide guides for practical applications of using PPD to computationally predict protein complex structures. In this work, we aim to characterize the effects of using constraints, i.e. the amount and quality of experimental data or information, on the accuracies of data-driven PPD predictions. There are two main types of data that can be obtained from biochemical and biophysical experiments, or from bioinformatics predictions: 1) distance between two specific atoms, and 2) information of interface residues, to aid PPD predictions. Our results showed that only few distances, even with a significant amount of noises in the data, can greatly improve the performance of PPD from those of conventional PPD predictions. In comparison, a larger amount and better quality of interface residue information is needed for achieving high success rates in data-driven PPD predictions. The results from this study provide some needed guidelines for using data-driven PPD to predict protein complex structures, and point out directions for future research in this field. Hwang, Ming-Jing Lyu, Ping-Chiang 黃明經 呂平江 2012 學位論文 ; thesis 139 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 國立清華大學 === 生物資訊與結構生物研究所 === 100 === Protein-protein docking (PPD), a computational method for predicting the structure of a protein complex from known component structures, has the ability to reveal atomic details of the complex structure and make otherwise unattainable discoveries. Among different kinds of PPD methods developed, one conclusion from monitoring their progress over the years is that data-driven PPD is particularly useful because it can significantly increase the low success rates of PPD, albeit requiring the incorporation of experimental data or information as constraints for the docking. It is unclear, however, how vulnerable of data-driven PPD is to incorrect or ambiguous data, answers to which will provide guides for practical applications of using PPD to computationally predict protein complex structures. In this work, we aim to characterize the effects of using constraints, i.e. the amount and quality of experimental data or information, on the accuracies of data-driven PPD predictions. There are two main types of data that can be obtained from biochemical and biophysical experiments, or from bioinformatics predictions: 1) distance between two specific atoms, and 2) information of interface residues, to aid PPD predictions. Our results showed that only few distances, even with a significant amount of noises in the data, can greatly improve the performance of PPD from those of conventional PPD predictions. In comparison, a larger amount and better quality of interface residue information is needed for achieving high success rates in data-driven PPD predictions. The results from this study provide some needed guidelines for using data-driven PPD to predict protein complex structures, and point out directions for future research in this field.
author2 Hwang, Ming-Jing
author_facet Hwang, Ming-Jing
Shih, Edward S.C.
施宣誠
author Shih, Edward S.C.
施宣誠
spellingShingle Shih, Edward S.C.
施宣誠
數據驅動的蛋白質對接
author_sort Shih, Edward S.C.
title 數據驅動的蛋白質對接
title_short 數據驅動的蛋白質對接
title_full 數據驅動的蛋白質對接
title_fullStr 數據驅動的蛋白質對接
title_full_unstemmed 數據驅動的蛋白質對接
title_sort 數據驅動的蛋白質對接
publishDate 2012
url http://ndltd.ncl.edu.tw/handle/88750082076141307800
work_keys_str_mv AT shihedwardsc shùjùqūdòngdedànbáizhìduìjiē
AT shīxuānchéng shùjùqūdòngdedànbáizhìduìjiē
AT shihedwardsc datadrivenproteinproteindocking
AT shīxuānchéng datadrivenproteinproteindocking
_version_ 1718214604609290240