Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier

Apoptosis proteins play a key role in maintaining the stability of organism; the functions of apoptosis proteins are related to their subcellular locations which are used to understand the mechanism of programmed cell death. In this paper, we utilize GO annotation information of apoptosis proteins a...

Full description

Bibliographic Details
Main Authors: Xiao Wang, Hui Li, Qiuwen Zhang, Rong Wang
Format: Article
Language:English
Published: Hindawi Limited 2016-01-01
Series:BioMed Research International
Online Access:http://dx.doi.org/10.1155/2016/1793272
id doaj-daab054f11374f82beef42d24b59387c
record_format Article
spelling doaj-daab054f11374f82beef42d24b59387c2020-11-24T22:18:15ZengHindawi LimitedBioMed Research International2314-61332314-61412016-01-01201610.1155/2016/17932721793272Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN ClassifierXiao Wang0Hui Li1Qiuwen Zhang2Rong Wang3School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaSchool of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaSchool of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaSchool of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, ChinaApoptosis proteins play a key role in maintaining the stability of organism; the functions of apoptosis proteins are related to their subcellular locations which are used to understand the mechanism of programmed cell death. In this paper, we utilize GO annotation information of apoptosis proteins and their homologous proteins retrieved from GOA database to formulate feature vectors and then combine the distance weighted KNN classification algorithm with them to solve the data imbalance problem existing in CL317 data set to predict subcellular locations of apoptosis proteins. It is found that the number of homologous proteins can affect the overall prediction accuracy. Under the optimal number of homologous proteins, the overall prediction accuracy of our method on CL317 data set reaches 96.8% by Jackknife test. Compared with other existing methods, it shows that our proposed method is very effective and better than others for predicting subcellular localization of apoptosis proteins.http://dx.doi.org/10.1155/2016/1793272
collection DOAJ
language English
format Article
sources DOAJ
author Xiao Wang
Hui Li
Qiuwen Zhang
Rong Wang
spellingShingle Xiao Wang
Hui Li
Qiuwen Zhang
Rong Wang
Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier
BioMed Research International
author_facet Xiao Wang
Hui Li
Qiuwen Zhang
Rong Wang
author_sort Xiao Wang
title Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier
title_short Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier
title_full Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier
title_fullStr Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier
title_full_unstemmed Predicting Subcellular Localization of Apoptosis Proteins Combining GO Features of Homologous Proteins and Distance Weighted KNN Classifier
title_sort predicting subcellular localization of apoptosis proteins combining go features of homologous proteins and distance weighted knn classifier
publisher Hindawi Limited
series BioMed Research International
issn 2314-6133
2314-6141
publishDate 2016-01-01
description Apoptosis proteins play a key role in maintaining the stability of organism; the functions of apoptosis proteins are related to their subcellular locations which are used to understand the mechanism of programmed cell death. In this paper, we utilize GO annotation information of apoptosis proteins and their homologous proteins retrieved from GOA database to formulate feature vectors and then combine the distance weighted KNN classification algorithm with them to solve the data imbalance problem existing in CL317 data set to predict subcellular locations of apoptosis proteins. It is found that the number of homologous proteins can affect the overall prediction accuracy. Under the optimal number of homologous proteins, the overall prediction accuracy of our method on CL317 data set reaches 96.8% by Jackknife test. Compared with other existing methods, it shows that our proposed method is very effective and better than others for predicting subcellular localization of apoptosis proteins.
url http://dx.doi.org/10.1155/2016/1793272
work_keys_str_mv AT xiaowang predictingsubcellularlocalizationofapoptosisproteinscombininggofeaturesofhomologousproteinsanddistanceweightedknnclassifier
AT huili predictingsubcellularlocalizationofapoptosisproteinscombininggofeaturesofhomologousproteinsanddistanceweightedknnclassifier
AT qiuwenzhang predictingsubcellularlocalizationofapoptosisproteinscombininggofeaturesofhomologousproteinsanddistanceweightedknnclassifier
AT rongwang predictingsubcellularlocalizationofapoptosisproteinscombininggofeaturesofhomologousproteinsanddistanceweightedknnclassifier
_version_ 1725782710922772480