A tool for the SUMOylation prediction by considering the effects of various post-translational modifications on lysine

碩士 === 國立中興大學 === 生物科技學研究所 === 105 === Recently, most SUMOylation prediction tools are using algorism, protein physicochemical and biochemical properties or consensus motif to predict modification sites. But these tools rarely mention the effect of other post translational modification (PTM) on sumo...

Full description

Bibliographic Details
Main Authors: Chin-Hau Tu, 凃俊豪
Other Authors: Yen-Wei Chu
Format: Others
Language:en_US
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/22455109797182811502
id ndltd-TW-105NCHU5111002
record_format oai_dc
spelling ndltd-TW-105NCHU51110022017-09-15T04:40:21Z http://ndltd.ncl.edu.tw/handle/22455109797182811502 A tool for the SUMOylation prediction by considering the effects of various post-translational modifications on lysine 考慮不同後修飾在離胺酸上影響的類泛素化位點預測工具 Chin-Hau Tu 凃俊豪 碩士 國立中興大學 生物科技學研究所 105 Recently, most SUMOylation prediction tools are using algorism, protein physicochemical and biochemical properties or consensus motif to predict modification sites. But these tools rarely mention the effect of other post translational modification (PTM) on sumoylation prediction. In this study, we developed a sumoylation prediction system based on machine learning approach employing SVM (support vector machine) and also updated sumoylation consensus motif and related information. In the feature coding, we encoded binary code and protein properties based on amino acid sequence. Besides, we encoded other PTM distribution as functional feature and secondary information as structure feature. We tested the prediction system that removed the post-modification distribution code and found that the prediction system had lower accuracy than the non-removed post-modification coding. Top fifty percent of feature rankings from the two feature selection methods, eighty and forty percent of all post-modification distributions were included. Those result show the influence of other post-modification sites in this study. In addition, we analyzed the number of the post-modification distributions under the central lysine and window size 21 rules, and we provided some of our findings and recommended post-modification types that could be considered. Finally, this study developed a new sumoylation prediction tool called SUMOdig. We tested SUMOdig composed of positive and randomly negative in ratio 1:1 at twenty times. The sumoylation sites with an average Matthew’s correlation coefficient is equal to 0.5114. Yen-Wei Chu 朱彥煒 2017 學位論文 ; thesis 87 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立中興大學 === 生物科技學研究所 === 105 === Recently, most SUMOylation prediction tools are using algorism, protein physicochemical and biochemical properties or consensus motif to predict modification sites. But these tools rarely mention the effect of other post translational modification (PTM) on sumoylation prediction. In this study, we developed a sumoylation prediction system based on machine learning approach employing SVM (support vector machine) and also updated sumoylation consensus motif and related information. In the feature coding, we encoded binary code and protein properties based on amino acid sequence. Besides, we encoded other PTM distribution as functional feature and secondary information as structure feature. We tested the prediction system that removed the post-modification distribution code and found that the prediction system had lower accuracy than the non-removed post-modification coding. Top fifty percent of feature rankings from the two feature selection methods, eighty and forty percent of all post-modification distributions were included. Those result show the influence of other post-modification sites in this study. In addition, we analyzed the number of the post-modification distributions under the central lysine and window size 21 rules, and we provided some of our findings and recommended post-modification types that could be considered. Finally, this study developed a new sumoylation prediction tool called SUMOdig. We tested SUMOdig composed of positive and randomly negative in ratio 1:1 at twenty times. The sumoylation sites with an average Matthew’s correlation coefficient is equal to 0.5114.
author2 Yen-Wei Chu
author_facet Yen-Wei Chu
Chin-Hau Tu
凃俊豪
author Chin-Hau Tu
凃俊豪
spellingShingle Chin-Hau Tu
凃俊豪
A tool for the SUMOylation prediction by considering the effects of various post-translational modifications on lysine
author_sort Chin-Hau Tu
title A tool for the SUMOylation prediction by considering the effects of various post-translational modifications on lysine
title_short A tool for the SUMOylation prediction by considering the effects of various post-translational modifications on lysine
title_full A tool for the SUMOylation prediction by considering the effects of various post-translational modifications on lysine
title_fullStr A tool for the SUMOylation prediction by considering the effects of various post-translational modifications on lysine
title_full_unstemmed A tool for the SUMOylation prediction by considering the effects of various post-translational modifications on lysine
title_sort tool for the sumoylation prediction by considering the effects of various post-translational modifications on lysine
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/22455109797182811502
work_keys_str_mv AT chinhautu atoolforthesumoylationpredictionbyconsideringtheeffectsofvariousposttranslationalmodificationsonlysine
AT tújùnháo atoolforthesumoylationpredictionbyconsideringtheeffectsofvariousposttranslationalmodificationsonlysine
AT chinhautu kǎolǜbùtónghòuxiūshìzàilíànsuānshàngyǐngxiǎngdelèifànsùhuàwèidiǎnyùcègōngjù
AT tújùnháo kǎolǜbùtónghòuxiūshìzàilíànsuānshàngyǐngxiǎngdelèifànsùhuàwèidiǎnyùcègōngjù
AT chinhautu toolforthesumoylationpredictionbyconsideringtheeffectsofvariousposttranslationalmodificationsonlysine
AT tújùnháo toolforthesumoylationpredictionbyconsideringtheeffectsofvariousposttranslationalmodificationsonlysine
_version_ 1718533896263434240