Prediction of protein thermostability using Decision Tree base on sequence and structure features
碩士 === 國立中央大學 === 資訊工程研究所 === 93 === The protein thermostability information is closely related to production of many biomaterials. Recent developments in research on the proteins thermostability find out the significant features for thermal stability of protein according to comparisons between homo...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2005
|
Online Access: | http://ndltd.ncl.edu.tw/handle/57058783926064390275 |
id |
ndltd-TW-093NCU05392076 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093NCU053920762015-10-13T11:53:59Z http://ndltd.ncl.edu.tw/handle/57058783926064390275 Prediction of protein thermostability using Decision Tree base on sequence and structure features 利用決策樹以蛋白質序列及結構預測熱穩定性 Jian-Sin Li 李見信 碩士 國立中央大學 資訊工程研究所 93 The protein thermostability information is closely related to production of many biomaterials. Recent developments in research on the proteins thermostability find out the significant features for thermal stability of protein according to comparisons between homologous proteins. The amino acid composition, special pattern in sequence information and hydrogen bond, disulfide bond, salt bridges and so on in protein structure are considered important for thermostability. In this study, we present a system to integrate various factors to predict protein thermostability. In our research, a large number of proteins are from PGTdb and PDB. To start with, fetch out various features form sequences and structures. Then, feature selection algorithm is used to filter the features that have higher linear correlation coefficient to thermostability. Lastly, we apply these features to machine learning approach to built a predict system. In this research we discover two features, i.e., (E+F+M+R)/residue and charged/noncharged have linear correlation to thermostability. We finally establish two predict systems, one can predict protein thermostability by inputting protein sequences only, and the other can get better performance if the protein structure is known. Jorng-Tzong Horng 洪炯宗 2005 學位論文 ; thesis 50 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中央大學 === 資訊工程研究所 === 93 === The protein thermostability information is closely related to production of many biomaterials. Recent developments in research on the proteins thermostability find out the significant features for thermal stability of protein according to comparisons between homologous proteins. The amino acid composition, special pattern in sequence information and hydrogen bond, disulfide bond, salt bridges and so on in protein structure are considered important for thermostability. In this study, we present a system to integrate various factors to predict protein thermostability. In our research, a large number of proteins are from PGTdb and PDB. To start with, fetch out various features form sequences and structures. Then, feature selection algorithm is used to filter the features that have higher linear correlation coefficient to thermostability. Lastly, we apply these features to machine learning approach to built a predict system. In this research we discover two features, i.e., (E+F+M+R)/residue and charged/noncharged have linear correlation to thermostability. We finally establish two predict systems, one can predict protein thermostability by inputting protein sequences only, and the other can get better performance if the protein structure is known.
|
author2 |
Jorng-Tzong Horng |
author_facet |
Jorng-Tzong Horng Jian-Sin Li 李見信 |
author |
Jian-Sin Li 李見信 |
spellingShingle |
Jian-Sin Li 李見信 Prediction of protein thermostability using Decision Tree base on sequence and structure features |
author_sort |
Jian-Sin Li |
title |
Prediction of protein thermostability using Decision Tree base on sequence and structure features |
title_short |
Prediction of protein thermostability using Decision Tree base on sequence and structure features |
title_full |
Prediction of protein thermostability using Decision Tree base on sequence and structure features |
title_fullStr |
Prediction of protein thermostability using Decision Tree base on sequence and structure features |
title_full_unstemmed |
Prediction of protein thermostability using Decision Tree base on sequence and structure features |
title_sort |
prediction of protein thermostability using decision tree base on sequence and structure features |
publishDate |
2005 |
url |
http://ndltd.ncl.edu.tw/handle/57058783926064390275 |
work_keys_str_mv |
AT jiansinli predictionofproteinthermostabilityusingdecisiontreebaseonsequenceandstructurefeatures AT lǐjiànxìn predictionofproteinthermostabilityusingdecisiontreebaseonsequenceandstructurefeatures AT jiansinli lìyòngjuécèshùyǐdànbáizhìxùlièjíjiégòuyùcèrèwěndìngxìng AT lǐjiànxìn lìyòngjuécèshùyǐdànbáizhìxùlièjíjiégòuyùcèrèwěndìngxìng |
_version_ |
1716850503188480000 |