Prediction for Membrane Protein Types Based on Effective Fusion Representation and MIC-GA Feature Selection

Membrane proteins occupy an important position in the life activities of humans and other species. The elucidation of membrane protein types provides clues for understanding the structure and function of proteins. With the fusion of various protein information including amino acid classification, ph...

Full description

Bibliographic Details
Main Authors: Lei Guo, Shunfang Wang, Zhenfeng Lei, Xueren Wang
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8539982/
id doaj-7f4e1ad267d74f05b77c488f391d20d4
record_format Article
spelling doaj-7f4e1ad267d74f05b77c488f391d20d42021-03-29T21:35:02ZengIEEEIEEE Access2169-35362018-01-016756697568110.1109/ACCESS.2018.28796358539982Prediction for Membrane Protein Types Based on Effective Fusion Representation and MIC-GA Feature SelectionLei Guo0Shunfang Wang1https://orcid.org/0000-0002-1927-8753Zhenfeng Lei2https://orcid.org/0000-0001-7714-1993Xueren Wang3School of Information Science and Engineering, Yunnan University, Kunming, ChinaSchool of Information Science and Engineering, Yunnan University, Kunming, ChinaSchool of Information Science and Engineering, Xiamen University, Xiamen, ChinaSchool of Mathematics and Statistics, Yunnan University, Kunming, ChinaMembrane proteins occupy an important position in the life activities of humans and other species. The elucidation of membrane protein types provides clues for understanding the structure and function of proteins. With the fusion of various protein information including amino acid classification, physicochemical property, and evolutionary information, this paper proposes a system for predicting membrane protein types. In this system, a new feature selection method called MIC-GA is proposed to deal with the curse of high-dimensional features. The findings show that this approach is effective in reducing feature dimensions and improves prediction accuracy. Ensemble method based on stacked generalization is also used to solve the problem of feature heterogeneity. The performance of the present method is evaluated on two benchmark datasets. The overall prediction accuracies of eight types are 89.23% and 93.49% using jackknife test and independent test, respectively. The final experimental results show that our method is more effective than the existing methods for prediction of membrane protein types.https://ieeexplore.ieee.org/document/8539982/Prediction for membrane protein typesfusion representationMIC-GA feature selectionensemble methodstacked generalization
collection DOAJ
language English
format Article
sources DOAJ
author Lei Guo
Shunfang Wang
Zhenfeng Lei
Xueren Wang
spellingShingle Lei Guo
Shunfang Wang
Zhenfeng Lei
Xueren Wang
Prediction for Membrane Protein Types Based on Effective Fusion Representation and MIC-GA Feature Selection
IEEE Access
Prediction for membrane protein types
fusion representation
MIC-GA feature selection
ensemble method
stacked generalization
author_facet Lei Guo
Shunfang Wang
Zhenfeng Lei
Xueren Wang
author_sort Lei Guo
title Prediction for Membrane Protein Types Based on Effective Fusion Representation and MIC-GA Feature Selection
title_short Prediction for Membrane Protein Types Based on Effective Fusion Representation and MIC-GA Feature Selection
title_full Prediction for Membrane Protein Types Based on Effective Fusion Representation and MIC-GA Feature Selection
title_fullStr Prediction for Membrane Protein Types Based on Effective Fusion Representation and MIC-GA Feature Selection
title_full_unstemmed Prediction for Membrane Protein Types Based on Effective Fusion Representation and MIC-GA Feature Selection
title_sort prediction for membrane protein types based on effective fusion representation and mic-ga feature selection
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description Membrane proteins occupy an important position in the life activities of humans and other species. The elucidation of membrane protein types provides clues for understanding the structure and function of proteins. With the fusion of various protein information including amino acid classification, physicochemical property, and evolutionary information, this paper proposes a system for predicting membrane protein types. In this system, a new feature selection method called MIC-GA is proposed to deal with the curse of high-dimensional features. The findings show that this approach is effective in reducing feature dimensions and improves prediction accuracy. Ensemble method based on stacked generalization is also used to solve the problem of feature heterogeneity. The performance of the present method is evaluated on two benchmark datasets. The overall prediction accuracies of eight types are 89.23% and 93.49% using jackknife test and independent test, respectively. The final experimental results show that our method is more effective than the existing methods for prediction of membrane protein types.
topic Prediction for membrane protein types
fusion representation
MIC-GA feature selection
ensemble method
stacked generalization
url https://ieeexplore.ieee.org/document/8539982/
work_keys_str_mv AT leiguo predictionformembraneproteintypesbasedoneffectivefusionrepresentationandmicgafeatureselection
AT shunfangwang predictionformembraneproteintypesbasedoneffectivefusionrepresentationandmicgafeatureselection
AT zhenfenglei predictionformembraneproteintypesbasedoneffectivefusionrepresentationandmicgafeatureselection
AT xuerenwang predictionformembraneproteintypesbasedoneffectivefusionrepresentationandmicgafeatureselection
_version_ 1724192659462946816