WildRBR: Integrated Prediction of RNA-binding Residues in a Protein Sequence by Voting Systems
碩士 === 元智大學 === 資訊工程學系 === 100 === RNA-protein interactions are most important intracellular biological processes, and there are essential for understanding mechanisms of various life activities within the cell. Specially, RNA-binding proteins (RBPs) play an important role in the RNA-protein interac...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Online Access: | http://ndltd.ncl.edu.tw/handle/20931316345810791072 |
id |
ndltd-TW-100YZU05392026 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100YZU053920262015-10-13T21:33:10Z http://ndltd.ncl.edu.tw/handle/20931316345810791072 WildRBR: Integrated Prediction of RNA-binding Residues in a Protein Sequence by Voting Systems WildRBR: 從蛋白質序列預測核糖核酸結合殘基之整合型票選系統預測器 Tai-Wei Chao 趙大緯 碩士 元智大學 資訊工程學系 100 RNA-protein interactions are most important intracellular biological processes, and there are essential for understanding mechanisms of various life activities within the cell. Specially, RNA-binding proteins (RBPs) play an important role in the RNA-protein interactions. Identification of RNA-binding residues (RBRs) in proteins can provide valuable insights for biologists. In the absence of structures for RNA-protein complexes; it is strongly desirable to predict RBRs by protein sequences alone. In this thesis, we present an integrated predictor with voting system named WildRBR to tackle this problem, which combines co-conserved motifs discovered by WildSpan with four best predictors as we have known, including BindN, PPRint, PRBR and PiRaNhA for identifying RBRs in protein sequences using several combinations and voting methods. We compare PRBR, PiRaNhA, PPRint, WildSpan and BindN with WildRBR based on 170 dataset (total of 170 RNA-binding proteins complexes), they achieved Matthew’s correlation coefficients (MCC)/F-score/Specificity/Sensitivity are 0.341/0.433/0.903/0.424、0.587/0.776/0.821/0.894、0.480/0.686/0.794/0.795、0.190/0.362/0.827/0.378、0.157/0.368/0.778/0.403 and 0.615/0.739/0.995/0.973, respectively. In conclusion, the predictive performance of WildRBR is better than other four predictors, it has highest sensitivity and accuracy. Conclusively, the efficiency of WildRBR not only is favorable in predicting RBRs from complex-structure-unknown protein but also can applied in predicting RNA-binding regions of proteins and identifying RNA-binding proteins. Finally, the release of WildRBR stand-alone program is largely desired in large-scale proteomics. 劉寶鈞 學位論文 ; thesis 54 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 元智大學 === 資訊工程學系 === 100 === RNA-protein interactions are most important intracellular biological processes, and there are essential for understanding mechanisms of various life activities within the cell. Specially, RNA-binding proteins (RBPs) play an important role in the RNA-protein interactions. Identification of RNA-binding residues (RBRs) in proteins can provide valuable insights for biologists. In the absence of structures for RNA-protein complexes; it is strongly desirable to predict RBRs by protein sequences alone. In this thesis, we present an integrated predictor with voting system named WildRBR to tackle this problem, which combines co-conserved motifs discovered by WildSpan with four best predictors as we have known, including BindN, PPRint, PRBR and PiRaNhA for identifying RBRs in protein sequences using several combinations and voting methods. We compare PRBR, PiRaNhA, PPRint, WildSpan and BindN with WildRBR based on 170 dataset (total of 170 RNA-binding proteins complexes), they achieved Matthew’s correlation coefficients (MCC)/F-score/Specificity/Sensitivity are 0.341/0.433/0.903/0.424、0.587/0.776/0.821/0.894、0.480/0.686/0.794/0.795、0.190/0.362/0.827/0.378、0.157/0.368/0.778/0.403 and 0.615/0.739/0.995/0.973, respectively. In conclusion, the predictive performance of WildRBR is better than other four predictors, it has highest sensitivity and accuracy. Conclusively, the efficiency of WildRBR not only is favorable in predicting RBRs from complex-structure-unknown protein but also can applied in predicting RNA-binding regions of proteins and identifying RNA-binding proteins. Finally, the release of WildRBR stand-alone program is largely desired in large-scale proteomics.
|
author2 |
劉寶鈞 |
author_facet |
劉寶鈞 Tai-Wei Chao 趙大緯 |
author |
Tai-Wei Chao 趙大緯 |
spellingShingle |
Tai-Wei Chao 趙大緯 WildRBR: Integrated Prediction of RNA-binding Residues in a Protein Sequence by Voting Systems |
author_sort |
Tai-Wei Chao |
title |
WildRBR: Integrated Prediction of RNA-binding Residues in a Protein Sequence by Voting Systems |
title_short |
WildRBR: Integrated Prediction of RNA-binding Residues in a Protein Sequence by Voting Systems |
title_full |
WildRBR: Integrated Prediction of RNA-binding Residues in a Protein Sequence by Voting Systems |
title_fullStr |
WildRBR: Integrated Prediction of RNA-binding Residues in a Protein Sequence by Voting Systems |
title_full_unstemmed |
WildRBR: Integrated Prediction of RNA-binding Residues in a Protein Sequence by Voting Systems |
title_sort |
wildrbr: integrated prediction of rna-binding residues in a protein sequence by voting systems |
url |
http://ndltd.ncl.edu.tw/handle/20931316345810791072 |
work_keys_str_mv |
AT taiweichao wildrbrintegratedpredictionofrnabindingresiduesinaproteinsequencebyvotingsystems AT zhàodàwěi wildrbrintegratedpredictionofrnabindingresiduesinaproteinsequencebyvotingsystems AT taiweichao wildrbrcóngdànbáizhìxùlièyùcèhétánghésuānjiéhécánjīzhīzhěnghéxíngpiàoxuǎnxìtǒngyùcèqì AT zhàodàwěi wildrbrcóngdànbáizhìxùlièyùcèhétánghésuānjiéhécánjīzhīzhěnghéxíngpiàoxuǎnxìtǒngyùcèqì |
_version_ |
1718066281687547904 |