Identifying protein phosphorylation sites with kinase substrate specificity on human viruses.
Viruses infect humans and progress inside the body leading to various diseases and complications. The phosphorylation of viral proteins catalyzed by host kinases plays crucial regulatory roles in enhancing replication and inhibition of normal host-cell functions. Due to its biological importance, th...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2012-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC3402495?pdf=render |
id |
doaj-44a980114bce45f2bfaf152101e5c1d5 |
---|---|
record_format |
Article |
spelling |
doaj-44a980114bce45f2bfaf152101e5c1d52020-11-25T01:22:44ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0177e4069410.1371/journal.pone.0040694Identifying protein phosphorylation sites with kinase substrate specificity on human viruses.Neil Arvin BretañaCheng-Tsung LuChiu-Yun ChiangMin-Gang SuKai-Yao HuangTzong-Yi LeeShun-Long WengViruses infect humans and progress inside the body leading to various diseases and complications. The phosphorylation of viral proteins catalyzed by host kinases plays crucial regulatory roles in enhancing replication and inhibition of normal host-cell functions. Due to its biological importance, there is a desire to identify the protein phosphorylation sites on human viruses. However, the use of mass spectrometry-based experiments is proven to be expensive and labor-intensive. Furthermore, previous studies which have identified phosphorylation sites in human viruses do not include the investigation of the responsible kinases. Thus, we are motivated to propose a new method to identify protein phosphorylation sites with its kinase substrate specificity on human viruses. The experimentally verified phosphorylation data were extracted from virPTM--a database containing 301 experimentally verified phosphorylation data on 104 human kinase-phosphorylated virus proteins. In an attempt to investigate kinase substrate specificities in viral protein phosphorylation sites, maximal dependence decomposition (MDD) is employed to cluster a large set of phosphorylation data into subgroups containing significantly conserved motifs. The experimental human phosphorylation sites are collected from Phospho.ELM, grouped according to its kinase annotation, and compared with the virus MDD clusters. This investigation identifies human kinases such as CK2, PKB, CDK, and MAPK as potential kinases for catalyzing virus protein substrates as confirmed by published literature. Profile hidden Markov model is then applied to learn a predictive model for each subgroup. A five-fold cross validation evaluation on the MDD-clustered HMMs yields an average accuracy of 84.93% for Serine, and 78.05% for Threonine. Furthermore, an independent testing data collected from UniProtKB and Phospho.ELM is used to make a comparison of predictive performance on three popular kinase-specific phosphorylation site prediction tools. In the independent testing, the high sensitivity and specificity of the proposed method demonstrate the predictive effectiveness of the identified substrate motifs and the importance of investigating potential kinases for viral protein phosphorylation sites.http://europepmc.org/articles/PMC3402495?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Neil Arvin Bretaña Cheng-Tsung Lu Chiu-Yun Chiang Min-Gang Su Kai-Yao Huang Tzong-Yi Lee Shun-Long Weng |
spellingShingle |
Neil Arvin Bretaña Cheng-Tsung Lu Chiu-Yun Chiang Min-Gang Su Kai-Yao Huang Tzong-Yi Lee Shun-Long Weng Identifying protein phosphorylation sites with kinase substrate specificity on human viruses. PLoS ONE |
author_facet |
Neil Arvin Bretaña Cheng-Tsung Lu Chiu-Yun Chiang Min-Gang Su Kai-Yao Huang Tzong-Yi Lee Shun-Long Weng |
author_sort |
Neil Arvin Bretaña |
title |
Identifying protein phosphorylation sites with kinase substrate specificity on human viruses. |
title_short |
Identifying protein phosphorylation sites with kinase substrate specificity on human viruses. |
title_full |
Identifying protein phosphorylation sites with kinase substrate specificity on human viruses. |
title_fullStr |
Identifying protein phosphorylation sites with kinase substrate specificity on human viruses. |
title_full_unstemmed |
Identifying protein phosphorylation sites with kinase substrate specificity on human viruses. |
title_sort |
identifying protein phosphorylation sites with kinase substrate specificity on human viruses. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2012-01-01 |
description |
Viruses infect humans and progress inside the body leading to various diseases and complications. The phosphorylation of viral proteins catalyzed by host kinases plays crucial regulatory roles in enhancing replication and inhibition of normal host-cell functions. Due to its biological importance, there is a desire to identify the protein phosphorylation sites on human viruses. However, the use of mass spectrometry-based experiments is proven to be expensive and labor-intensive. Furthermore, previous studies which have identified phosphorylation sites in human viruses do not include the investigation of the responsible kinases. Thus, we are motivated to propose a new method to identify protein phosphorylation sites with its kinase substrate specificity on human viruses. The experimentally verified phosphorylation data were extracted from virPTM--a database containing 301 experimentally verified phosphorylation data on 104 human kinase-phosphorylated virus proteins. In an attempt to investigate kinase substrate specificities in viral protein phosphorylation sites, maximal dependence decomposition (MDD) is employed to cluster a large set of phosphorylation data into subgroups containing significantly conserved motifs. The experimental human phosphorylation sites are collected from Phospho.ELM, grouped according to its kinase annotation, and compared with the virus MDD clusters. This investigation identifies human kinases such as CK2, PKB, CDK, and MAPK as potential kinases for catalyzing virus protein substrates as confirmed by published literature. Profile hidden Markov model is then applied to learn a predictive model for each subgroup. A five-fold cross validation evaluation on the MDD-clustered HMMs yields an average accuracy of 84.93% for Serine, and 78.05% for Threonine. Furthermore, an independent testing data collected from UniProtKB and Phospho.ELM is used to make a comparison of predictive performance on three popular kinase-specific phosphorylation site prediction tools. In the independent testing, the high sensitivity and specificity of the proposed method demonstrate the predictive effectiveness of the identified substrate motifs and the importance of investigating potential kinases for viral protein phosphorylation sites. |
url |
http://europepmc.org/articles/PMC3402495?pdf=render |
work_keys_str_mv |
AT neilarvinbretana identifyingproteinphosphorylationsiteswithkinasesubstratespecificityonhumanviruses AT chengtsunglu identifyingproteinphosphorylationsiteswithkinasesubstratespecificityonhumanviruses AT chiuyunchiang identifyingproteinphosphorylationsiteswithkinasesubstratespecificityonhumanviruses AT mingangsu identifyingproteinphosphorylationsiteswithkinasesubstratespecificityonhumanviruses AT kaiyaohuang identifyingproteinphosphorylationsiteswithkinasesubstratespecificityonhumanviruses AT tzongyilee identifyingproteinphosphorylationsiteswithkinasesubstratespecificityonhumanviruses AT shunlongweng identifyingproteinphosphorylationsiteswithkinasesubstratespecificityonhumanviruses |
_version_ |
1725125752939085824 |