Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network
Pathway-specific protein domain (PSPD) are associated with specific pathways. Many protein domains are pervasive in various biological processes, whereas other domains are linked to specific pathways. Many human disease pathways, such as cancer pathways and signaling pathway-related diseases, have c...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9210038/ |
id |
doaj-db49238d41ea470cb59c44f656b26011 |
---|---|
record_format |
Article |
spelling |
doaj-db49238d41ea470cb59c44f656b260112021-03-30T03:19:35ZengIEEEIEEE Access2169-35362020-01-01818014018015510.1109/ACCESS.2020.30278879210038Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural NetworkAli Ghualm0https://orcid.org/0000-0001-5166-2213Xiujuan Lei1https://orcid.org/0000-0002-9901-1732Yuchen Zhang2Shi Cheng3https://orcid.org/0000-0002-5129-995XMin Guo4School of Computer Science, Shaanxi Normal University, Xi’an, ChinaSchool of Computer Science, Shaanxi Normal University, Xi’an, ChinaSchool of Computer Science, Shaanxi Normal University, Xi’an, ChinaSchool of Computer Science, Shaanxi Normal University, Xi’an, ChinaSchool of Computer Science, Shaanxi Normal University, Xi’an, ChinaPathway-specific protein domain (PSPD) are associated with specific pathways. Many protein domains are pervasive in various biological processes, whereas other domains are linked to specific pathways. Many human disease pathways, such as cancer pathways and signaling pathway-related diseases, have caused the loss of functional PSPD. Therefore, the creation of an accurate method to predict its roles is a critical step toward human disease and pathways. In this study, we proposed a deep learning model based on a two-dimensional neural network (2D-CNN-PSPD) with a pathway-specific protein domain association prediction. In terms of the purposes of a sub-pathway, its parent pathway and its super pathway are linked to the Uni-Pathway. We also proposed a dipeptide composition (DPC) model and a dipeptide deviation (DDE) model of feature extraction profiles as PSSM. Then, we predicted the proteins associated with the same sub-pathway or with the same organism. The DDE model and DPC model of the PSSM feature profile input was associated with our proposed 2D-CNN method. We deployed several parameters to optimize the model's output performance and used the hyperparameter optimization approach to find the best model for our dataset based on the 10-fold cross-validation results. Ultimately, we assessed the predictive performance of the current model by using independent datasets and cross-validation datasets. Therefore, we enhanced the efficiency of deep learning methods. PSPD is involved in any known pathway and then follow the association in different stages of the pathway hierarchy with other proteins. Our proposed method could identify 2D-CNN-PSPD with 0.83% sensitivity, 0.92% specificity, 87.27% accuracy, and 0.75% accuracy. We provided an important method for the analysis of PSPD proteins in the proposed research, and our achievements might promote computational biological research. We concluded our proposed model architecture in the future, the use of the latest features, and the multi-one structure to predict different types of molecules, such as DNA, RNA, and disease-pathway specific proteins associations.https://ieeexplore.ieee.org/document/9210038/Molecular structure predictiondeep learningconvolutional neural networkdeep learningevolutionary knowledgemultiple features |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ali Ghualm Xiujuan Lei Yuchen Zhang Shi Cheng Min Guo |
spellingShingle |
Ali Ghualm Xiujuan Lei Yuchen Zhang Shi Cheng Min Guo Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network IEEE Access Molecular structure prediction deep learning convolutional neural network deep learning evolutionary knowledge multiple features |
author_facet |
Ali Ghualm Xiujuan Lei Yuchen Zhang Shi Cheng Min Guo |
author_sort |
Ali Ghualm |
title |
Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network |
title_short |
Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network |
title_full |
Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network |
title_fullStr |
Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network |
title_full_unstemmed |
Identification of Pathway-Specific Protein Domain by Incorporating Hyperparameter Optimization Based on 2D Convolutional Neural Network |
title_sort |
identification of pathway-specific protein domain by incorporating hyperparameter optimization based on 2d convolutional neural network |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Pathway-specific protein domain (PSPD) are associated with specific pathways. Many protein domains are pervasive in various biological processes, whereas other domains are linked to specific pathways. Many human disease pathways, such as cancer pathways and signaling pathway-related diseases, have caused the loss of functional PSPD. Therefore, the creation of an accurate method to predict its roles is a critical step toward human disease and pathways. In this study, we proposed a deep learning model based on a two-dimensional neural network (2D-CNN-PSPD) with a pathway-specific protein domain association prediction. In terms of the purposes of a sub-pathway, its parent pathway and its super pathway are linked to the Uni-Pathway. We also proposed a dipeptide composition (DPC) model and a dipeptide deviation (DDE) model of feature extraction profiles as PSSM. Then, we predicted the proteins associated with the same sub-pathway or with the same organism. The DDE model and DPC model of the PSSM feature profile input was associated with our proposed 2D-CNN method. We deployed several parameters to optimize the model's output performance and used the hyperparameter optimization approach to find the best model for our dataset based on the 10-fold cross-validation results. Ultimately, we assessed the predictive performance of the current model by using independent datasets and cross-validation datasets. Therefore, we enhanced the efficiency of deep learning methods. PSPD is involved in any known pathway and then follow the association in different stages of the pathway hierarchy with other proteins. Our proposed method could identify 2D-CNN-PSPD with 0.83% sensitivity, 0.92% specificity, 87.27% accuracy, and 0.75% accuracy. We provided an important method for the analysis of PSPD proteins in the proposed research, and our achievements might promote computational biological research. We concluded our proposed model architecture in the future, the use of the latest features, and the multi-one structure to predict different types of molecules, such as DNA, RNA, and disease-pathway specific proteins associations. |
topic |
Molecular structure prediction deep learning convolutional neural network deep learning evolutionary knowledge multiple features |
url |
https://ieeexplore.ieee.org/document/9210038/ |
work_keys_str_mv |
AT alighualm identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork AT xiujuanlei identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork AT yuchenzhang identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork AT shicheng identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork AT minguo identificationofpathwayspecificproteindomainbyincorporatinghyperparameteroptimizationbasedon2dconvolutionalneuralnetwork |
_version_ |
1724183694800846848 |