Multi-label multi-kernel transfer learning for human protein subcellular localization.

Recent years have witnessed much progress in computational modelling for protein subcellular localization. However, the existing sequence-based predictive models demonstrate moderate or unsatisfactory performance, and the gene ontology (GO) based models may take the risk of performance overestimatio...

Full description

Bibliographic Details
Main Author:	Suyu Mei
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2012-01-01
Series:	PLoS ONE
Online Access:	http://europepmc.org/articles/PMC3374840?pdf=render

id	doaj-c9b73c9d88ef46669fba84fafb7f543c
record_format	Article
spelling	doaj-c9b73c9d88ef46669fba84fafb7f543c2020-11-25T01:56:02ZengPublic Library of Science (PLoS)PLoS ONE1932-62032012-01-0176e3771610.1371/journal.pone.0037716Multi-label multi-kernel transfer learning for human protein subcellular localization.Suyu MeiRecent years have witnessed much progress in computational modelling for protein subcellular localization. However, the existing sequence-based predictive models demonstrate moderate or unsatisfactory performance, and the gene ontology (GO) based models may take the risk of performance overestimation for novel proteins. Furthermore, many human proteins have multiple subcellular locations, which renders the computational modelling more complicated. Up to the present, there are far few researches specialized for predicting the subcellular localization of human proteins that may reside in multiple cellular compartments. In this paper, we propose a multi-label multi-kernel transfer learning model for human protein subcellular localization (MLMK-TLM). MLMK-TLM proposes a multi-label confusion matrix, formally formulates three multi-labelling performance measures and adapts one-against-all multi-class probabilistic outputs to multi-label learning scenario, based on which to further extends our published work GO-TLM (gene ontology based transfer learning model for protein subcellular localization) and MK-TLM (multi-kernel transfer learning based on Chou's PseAAC formulation for protein submitochondria localization) for multiplex human protein subcellular localization. With the advantages of proper homolog knowledge transfer, comprehensive survey of model performance for novel protein and multi-labelling capability, MLMK-TLM will gain more practical applicability. The experiments on human protein benchmark dataset show that MLMK-TLM significantly outperforms the baseline model and demonstrates good multi-labelling ability for novel human proteins. Some findings (predictions) are validated by the latest Swiss-Prot database. The software can be freely downloaded at http://soft.synu.edu.cn/upload/msy.rar.http://europepmc.org/articles/PMC3374840?pdf=render
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Suyu Mei
spellingShingle	Suyu Mei Multi-label multi-kernel transfer learning for human protein subcellular localization. PLoS ONE
author_facet	Suyu Mei
author_sort	Suyu Mei
title	Multi-label multi-kernel transfer learning for human protein subcellular localization.
title_short	Multi-label multi-kernel transfer learning for human protein subcellular localization.
title_full	Multi-label multi-kernel transfer learning for human protein subcellular localization.
title_fullStr	Multi-label multi-kernel transfer learning for human protein subcellular localization.
title_full_unstemmed	Multi-label multi-kernel transfer learning for human protein subcellular localization.
title_sort	multi-label multi-kernel transfer learning for human protein subcellular localization.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2012-01-01
description	Recent years have witnessed much progress in computational modelling for protein subcellular localization. However, the existing sequence-based predictive models demonstrate moderate or unsatisfactory performance, and the gene ontology (GO) based models may take the risk of performance overestimation for novel proteins. Furthermore, many human proteins have multiple subcellular locations, which renders the computational modelling more complicated. Up to the present, there are far few researches specialized for predicting the subcellular localization of human proteins that may reside in multiple cellular compartments. In this paper, we propose a multi-label multi-kernel transfer learning model for human protein subcellular localization (MLMK-TLM). MLMK-TLM proposes a multi-label confusion matrix, formally formulates three multi-labelling performance measures and adapts one-against-all multi-class probabilistic outputs to multi-label learning scenario, based on which to further extends our published work GO-TLM (gene ontology based transfer learning model for protein subcellular localization) and MK-TLM (multi-kernel transfer learning based on Chou's PseAAC formulation for protein submitochondria localization) for multiplex human protein subcellular localization. With the advantages of proper homolog knowledge transfer, comprehensive survey of model performance for novel protein and multi-labelling capability, MLMK-TLM will gain more practical applicability. The experiments on human protein benchmark dataset show that MLMK-TLM significantly outperforms the baseline model and demonstrates good multi-labelling ability for novel human proteins. Some findings (predictions) are validated by the latest Swiss-Prot database. The software can be freely downloaded at http://soft.synu.edu.cn/upload/msy.rar.
url	http://europepmc.org/articles/PMC3374840?pdf=render
work_keys_str_mv	AT suyumei multilabelmultikerneltransferlearningforhumanproteinsubcellularlocalization
_version_	1724982006907928576

Multi-label multi-kernel transfer learning for human protein subcellular localization.

Similar Items