Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network

Pathway-specific protein domain (PSPD) are associated with specific pathways. Many protein domains are pervasive in various biological processes, whereas other domains are linked to specific pathways. Many human disease pathways, such as cancer pathways and signaling pathway-related diseases, have c...

Full description

Bibliographic Details
Main Authors: Ali Ghualm, Xiujuan Lei, Yuchen Zhang, Shi Cheng, Min Guo
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9210038/
id doaj-db49238d41ea470cb59c44f656b26011
record_format Article
spelling doaj-db49238d41ea470cb59c44f656b260112021-03-30T03:19:35ZengIEEEIEEE Access2169-35362020-01-01818014018015510.1109/ACCESS.2020.30278879210038Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural NetworkAli Ghualm0https://orcid.org/0000-0001-5166-2213Xiujuan Lei1https://orcid.org/0000-0002-9901-1732Yuchen Zhang2Shi Cheng3https://orcid.org/0000-0002-5129-995XMin Guo4School of Computer Science, Shaanxi Normal University, Xi’an, ChinaSchool of Computer Science, Shaanxi Normal University, Xi’an, ChinaSchool of Computer Science, Shaanxi Normal University, Xi’an, ChinaSchool of Computer Science, Shaanxi Normal University, Xi’an, ChinaSchool of Computer Science, Shaanxi Normal University, Xi’an, ChinaPathway-specific protein domain (PSPD) are associated with specific pathways. Many protein domains are pervasive in various biological processes, whereas other domains are linked to specific pathways. Many human disease pathways, such as cancer pathways and signaling pathway-related diseases, have caused the loss of functional PSPD. Therefore, the creation of an accurate method to predict its roles is a critical step toward human disease and pathways. In this study, we proposed a deep learning model based on a two-dimensional neural network (2D-CNN-PSPD) with a pathway-specific protein domain association prediction. In terms of the purposes of a sub-pathway, its parent pathway and its super pathway are linked to the Uni-Pathway. We also proposed a dipeptide composition (DPC) model and a dipeptide deviation (DDE) model of feature extraction profiles as PSSM. Then, we predicted the proteins associated with the same sub-pathway or with the same organism. The DDE model and DPC model of the PSSM feature profile input was associated with our proposed 2D-CNN method. We deployed several parameters to optimize the model's output performance and used the hyperparameter optimization approach to find the best model for our dataset based on the 10-fold cross-validation results. Ultimately, we assessed the predictive performance of the current model by using independent datasets and cross-validation datasets. Therefore, we enhanced the efficiency of deep learning methods. PSPD is involved in any known pathway and then follow the association in different stages of the pathway hierarchy with other proteins. Our proposed method could identify 2D-CNN-PSPD with 0.83% sensitivity, 0.92% specificity, 87.27% accuracy, and 0.75% accuracy. We provided an important method for the analysis of PSPD proteins in the proposed research, and our achievements might promote computational biological research. We concluded our proposed model architecture in the future, the use of the latest features, and the multi-one structure to predict different types of molecules, such as DNA, RNA, and disease-pathway specific proteins associations.https://ieeexplore.ieee.org/document/9210038/Molecular structure predictiondeep learningconvolutional neural networkdeep learningevolutionary knowledgemultiple features
collection DOAJ
language English
format Article
sources DOAJ
author Ali Ghualm
Xiujuan Lei
Yuchen Zhang
Shi Cheng
Min Guo
spellingShingle Ali Ghualm
Xiujuan Lei
Yuchen Zhang
Shi Cheng
Min Guo
Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network
IEEE Access
Molecular structure prediction
deep learning
convolutional neural network
deep learning
evolutionary knowledge
multiple features
author_facet Ali Ghualm
Xiujuan Lei
Yuchen Zhang
Shi Cheng
Min Guo
author_sort Ali Ghualm
title Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network
title_short Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network
title_full Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network
title_fullStr Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network
title_full_unstemmed Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network
title_sort identification of pathway-specific protein domain by incorporating hyperparameter optimization based on 2d convolutional neural network
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Pathway-specific protein domain (PSPD) are associated with specific pathways. Many protein domains are pervasive in various biological processes, whereas other domains are linked to specific pathways. Many human disease pathways, such as cancer pathways and signaling pathway-related diseases, have caused the loss of functional PSPD. Therefore, the creation of an accurate method to predict its roles is a critical step toward human disease and pathways. In this study, we proposed a deep learning model based on a two-dimensional neural network (2D-CNN-PSPD) with a pathway-specific protein domain association prediction. In terms of the purposes of a sub-pathway, its parent pathway and its super pathway are linked to the Uni-Pathway. We also proposed a dipeptide composition (DPC) model and a dipeptide deviation (DDE) model of feature extraction profiles as PSSM. Then, we predicted the proteins associated with the same sub-pathway or with the same organism. The DDE model and DPC model of the PSSM feature profile input was associated with our proposed 2D-CNN method. We deployed several parameters to optimize the model's output performance and used the hyperparameter optimization approach to find the best model for our dataset based on the 10-fold cross-validation results. Ultimately, we assessed the predictive performance of the current model by using independent datasets and cross-validation datasets. Therefore, we enhanced the efficiency of deep learning methods. PSPD is involved in any known pathway and then follow the association in different stages of the pathway hierarchy with other proteins. Our proposed method could identify 2D-CNN-PSPD with 0.83% sensitivity, 0.92% specificity, 87.27% accuracy, and 0.75% accuracy. We provided an important method for the analysis of PSPD proteins in the proposed research, and our achievements might promote computational biological research. We concluded our proposed model architecture in the future, the use of the latest features, and the multi-one structure to predict different types of molecules, such as DNA, RNA, and disease-pathway specific proteins associations.
topic Molecular structure prediction
deep learning
convolutional neural network
deep learning
evolutionary knowledge
multiple features
url https://ieeexplore.ieee.org/document/9210038/
work_keys_str_mv AT alighualm identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork
AT xiujuanlei identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork
AT yuchenzhang identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork
AT shicheng identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork
AT minguo identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork
_version_ 1724183694800846848