Exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs

碩士 === 元智大學 === 資訊工程學系 === 104 === S-sulfenylated protein, the covalent attachment of a hydroxyl group to a thiol of a cysteine amino acid, is a type of post-translational modification (PTM). This protein plays a crucial role in various biological processes including transcriptional regulation, apop...

Full description

Bibliographic Details
Main Authors: van-minh bui, 裴文明
Other Authors: Tzong-Yi Lee
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/22na83
id ndltd-TW-104YZU05392004
record_format oai_dc
spelling ndltd-TW-104YZU053920042019-05-15T22:34:37Z http://ndltd.ncl.edu.tw/handle/22na83 Exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs 利用最大依賴分解度辨識硫端亞磺酰化作用位置 van-minh bui 裴文明 碩士 元智大學 資訊工程學系 104 S-sulfenylated protein, the covalent attachment of a hydroxyl group to a thiol of a cysteine amino acid, is a type of post-translational modification (PTM). This protein plays a crucial role in various biological processes including transcriptional regulation, apoptosis and cytokine signaling. However, the shortage of discoveries in S-sulfenylation protein has many limited for researchers. The issues in discrimination the substrate site of this protein consider being essential tasks in computation biology for further studies in protein structures and functions. Most of methods are used to identify this protein to be done in practical laboratories, so the cost is quite expensive to buy complex equipment and sample and the execute time takes so much time for many steps. Until now, there are no tools dedicated to the computational identification of SOH sites. With a total of 1096 experimentally S-sulfenylated proteins, the study was carried out a bioinformatics investigation on SOH sites based on amino acid composition. A TwoSampleLogo indicates that the positively and negatively charged amino acids flanking the SOH sites may impact the formulation of S-sulfenylation. Moreover, the study also invested the potential features to build the best model in order to identify effectively the cysteine residue of this protein. In addition, the maximal dependence decomposition (MDD) was utilized to explore the substrate motifs of SOH sites. Based on the concept of binary classification between SOH and non-SOH sites, Support Vector Machine (SVM) was applied to learn the predictive model from MDD-identified substrate motifs. According to the evaluation of five-fold cross validation, the integrated SVM model learned from substrate motifs yields an average accuracy of 0.87, that leads to a significant improvement in the prediction of SOH sites. Moreover, the integrated SVM model also efficaciously enhances the performance in an independent testing set. Finally, the integrated SVM model was applied to implement an effective web resource, named MDD-SOH, for identifying SOH sites with their corresponding substrate motifs (http://csb.cse.yzu.edu.tw/MDDSOH/index.php). Tzong-Yi Lee 李宗夷 2016 學位論文 ; thesis 44 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 元智大學 === 資訊工程學系 === 104 === S-sulfenylated protein, the covalent attachment of a hydroxyl group to a thiol of a cysteine amino acid, is a type of post-translational modification (PTM). This protein plays a crucial role in various biological processes including transcriptional regulation, apoptosis and cytokine signaling. However, the shortage of discoveries in S-sulfenylation protein has many limited for researchers. The issues in discrimination the substrate site of this protein consider being essential tasks in computation biology for further studies in protein structures and functions. Most of methods are used to identify this protein to be done in practical laboratories, so the cost is quite expensive to buy complex equipment and sample and the execute time takes so much time for many steps. Until now, there are no tools dedicated to the computational identification of SOH sites. With a total of 1096 experimentally S-sulfenylated proteins, the study was carried out a bioinformatics investigation on SOH sites based on amino acid composition. A TwoSampleLogo indicates that the positively and negatively charged amino acids flanking the SOH sites may impact the formulation of S-sulfenylation. Moreover, the study also invested the potential features to build the best model in order to identify effectively the cysteine residue of this protein. In addition, the maximal dependence decomposition (MDD) was utilized to explore the substrate motifs of SOH sites. Based on the concept of binary classification between SOH and non-SOH sites, Support Vector Machine (SVM) was applied to learn the predictive model from MDD-identified substrate motifs. According to the evaluation of five-fold cross validation, the integrated SVM model learned from substrate motifs yields an average accuracy of 0.87, that leads to a significant improvement in the prediction of SOH sites. Moreover, the integrated SVM model also efficaciously enhances the performance in an independent testing set. Finally, the integrated SVM model was applied to implement an effective web resource, named MDD-SOH, for identifying SOH sites with their corresponding substrate motifs (http://csb.cse.yzu.edu.tw/MDDSOH/index.php).
author2 Tzong-Yi Lee
author_facet Tzong-Yi Lee
van-minh bui
裴文明
author van-minh bui
裴文明
spellingShingle van-minh bui
裴文明
Exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs
author_sort van-minh bui
title Exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs
title_short Exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs
title_full Exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs
title_fullStr Exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs
title_full_unstemmed Exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs
title_sort exploiting maximal dependence decomposition to identify s-sulfenylation sites with substrate motifs
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/22na83
work_keys_str_mv AT vanminhbui exploitingmaximaldependencedecompositiontoidentifyssulfenylationsiteswithsubstratemotifs
AT péiwénmíng exploitingmaximaldependencedecompositiontoidentifyssulfenylationsiteswithsubstratemotifs
AT vanminhbui lìyòngzuìdàyīlàifēnjiědùbiànshíliúduānyàhuángxiānhuàzuòyòngwèizhì
AT péiwénmíng lìyòngzuìdàyīlàifēnjiědùbiànshíliúduānyàhuángxiānhuàzuòyòngwèizhì
_version_ 1719132990015012864