RPI-MCNNBLSTM: BLSTM Networks Combining With Multiple Convolutional Neural Network Models to Predict RNA-Protein Interactions Using Multiple Biometric Features Codes

RNA plays an important role in many biological processes, and RNA functions are primarily achieved by binding with a variety of proteins. But with the increasing complexity of RPIs networks, high-throughput biological techniques are usually expensive and time consuming. Therefore, there is an urgent...

Full description

Bibliographic Details
Main Authors: Ruibo Gao, Tianhua Yang, Yaxue Shen, Yifan Rong, Kang Ye, Junlan Nie
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9224615/
id doaj-a0e48bc422844323b6bfafdd320e68bb
record_format Article
spelling doaj-a0e48bc422844323b6bfafdd320e68bb2021-03-30T04:10:22ZengIEEEIEEE Access2169-35362020-01-01818986918987710.1109/ACCESS.2020.30313019224615RPI-MCNNBLSTM: BLSTM Networks Combining With Multiple Convolutional Neural Network Models to Predict RNA-Protein Interactions Using Multiple Biometric Features CodesRuibo Gao0https://orcid.org/0000-0001-7931-2899Tianhua Yang1https://orcid.org/0000-0001-5769-1146Yaxue Shen2https://orcid.org/0000-0002-5221-5199Yifan Rong3https://orcid.org/0000-0002-4198-7863Kang Ye4https://orcid.org/0000-0003-0173-7621Junlan Nie5School of Information Science and Engineering, Yanshan University, Qinhuangdao, ChinaSchool of Information Science and Engineering, Yanshan University, Qinhuangdao, ChinaSchool of Information Science and Engineering, Yanshan University, Qinhuangdao, ChinaSchool of Information Science and Engineering, Yanshan University, Qinhuangdao, ChinaSchool of Information Science and Engineering, Yanshan University, Qinhuangdao, ChinaSchool of Information Science and Engineering, Yanshan University, Qinhuangdao, ChinaRNA plays an important role in many biological processes, and RNA functions are primarily achieved by binding with a variety of proteins. But with the increasing complexity of RPIs networks, high-throughput biological techniques are usually expensive and time consuming. Therefore, there is an urgent need for high speed and reliably computational methods to predict RNA-protein interactions. In this study, we propose a hybrid deep learning model: RPI-MCNNBLSTM, which combines three convolutional neural networks (CNN) with a BLSTM network, to predict RNA-protein interactions using many-sided biological information including protein sequences, RNA sequence and structure. Firstly, we adopt a filling method to pad sequence and structure into equal length, and perform numerical encoding for the sequence and structure of the above equal length, respectively, which are appropriate for subsequent convolution operations. Secondly, we establish the three CNNs to learn the three biological information, separately, then use the BLSTM to capture the long range dependencies among the three features identified by the CNNs. The learned weighted representations are fed into a classification layer to predict ncRNA-protein interactions. Finally, the experimental results indicate that the proposed method achieves superior performance with the accuracy of 98.37% on the RPI1807 dataset, 92.99% on the RPI2241 dataset, 95.47% on the RPI369 dataset, 90.0% on the RPI448 and 87.4% on the RPI1446 dataset, respectively. The code of RPI-MCNNBLSTM and the datasets used in this work are available at https://github.com/xiaopang136/RPI for academic users.https://ieeexplore.ieee.org/document/9224615/Multiple convolutional neural networkLSTM networksmultiple biometric features codesRNA-protein interactions
collection DOAJ
language English
format Article
sources DOAJ
author Ruibo Gao
Tianhua Yang
Yaxue Shen
Yifan Rong
Kang Ye
Junlan Nie
spellingShingle Ruibo Gao
Tianhua Yang
Yaxue Shen
Yifan Rong
Kang Ye
Junlan Nie
RPI-MCNNBLSTM: BLSTM Networks Combining With Multiple Convolutional Neural Network Models to Predict RNA-Protein Interactions Using Multiple Biometric Features Codes
IEEE Access
Multiple convolutional neural network
LSTM networks
multiple biometric features codes
RNA-protein interactions
author_facet Ruibo Gao
Tianhua Yang
Yaxue Shen
Yifan Rong
Kang Ye
Junlan Nie
author_sort Ruibo Gao
title RPI-MCNNBLSTM: BLSTM Networks Combining With Multiple Convolutional Neural Network Models to Predict RNA-Protein Interactions Using Multiple Biometric Features Codes
title_short RPI-MCNNBLSTM: BLSTM Networks Combining With Multiple Convolutional Neural Network Models to Predict RNA-Protein Interactions Using Multiple Biometric Features Codes
title_full RPI-MCNNBLSTM: BLSTM Networks Combining With Multiple Convolutional Neural Network Models to Predict RNA-Protein Interactions Using Multiple Biometric Features Codes
title_fullStr RPI-MCNNBLSTM: BLSTM Networks Combining With Multiple Convolutional Neural Network Models to Predict RNA-Protein Interactions Using Multiple Biometric Features Codes
title_full_unstemmed RPI-MCNNBLSTM: BLSTM Networks Combining With Multiple Convolutional Neural Network Models to Predict RNA-Protein Interactions Using Multiple Biometric Features Codes
title_sort rpi-mcnnblstm: blstm networks combining with multiple convolutional neural network models to predict rna-protein interactions using multiple biometric features codes
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description RNA plays an important role in many biological processes, and RNA functions are primarily achieved by binding with a variety of proteins. But with the increasing complexity of RPIs networks, high-throughput biological techniques are usually expensive and time consuming. Therefore, there is an urgent need for high speed and reliably computational methods to predict RNA-protein interactions. In this study, we propose a hybrid deep learning model: RPI-MCNNBLSTM, which combines three convolutional neural networks (CNN) with a BLSTM network, to predict RNA-protein interactions using many-sided biological information including protein sequences, RNA sequence and structure. Firstly, we adopt a filling method to pad sequence and structure into equal length, and perform numerical encoding for the sequence and structure of the above equal length, respectively, which are appropriate for subsequent convolution operations. Secondly, we establish the three CNNs to learn the three biological information, separately, then use the BLSTM to capture the long range dependencies among the three features identified by the CNNs. The learned weighted representations are fed into a classification layer to predict ncRNA-protein interactions. Finally, the experimental results indicate that the proposed method achieves superior performance with the accuracy of 98.37% on the RPI1807 dataset, 92.99% on the RPI2241 dataset, 95.47% on the RPI369 dataset, 90.0% on the RPI448 and 87.4% on the RPI1446 dataset, respectively. The code of RPI-MCNNBLSTM and the datasets used in this work are available at https://github.com/xiaopang136/RPI for academic users.
topic Multiple convolutional neural network
LSTM networks
multiple biometric features codes
RNA-protein interactions
url https://ieeexplore.ieee.org/document/9224615/
work_keys_str_mv AT ruibogao rpimcnnblstmblstmnetworkscombiningwithmultipleconvolutionalneuralnetworkmodelstopredictrnaproteininteractionsusingmultiplebiometricfeaturescodes
AT tianhuayang rpimcnnblstmblstmnetworkscombiningwithmultipleconvolutionalneuralnetworkmodelstopredictrnaproteininteractionsusingmultiplebiometricfeaturescodes
AT yaxueshen rpimcnnblstmblstmnetworkscombiningwithmultipleconvolutionalneuralnetworkmodelstopredictrnaproteininteractionsusingmultiplebiometricfeaturescodes
AT yifanrong rpimcnnblstmblstmnetworkscombiningwithmultipleconvolutionalneuralnetworkmodelstopredictrnaproteininteractionsusingmultiplebiometricfeaturescodes
AT kangye rpimcnnblstmblstmnetworkscombiningwithmultipleconvolutionalneuralnetworkmodelstopredictrnaproteininteractionsusingmultiplebiometricfeaturescodes
AT junlannie rpimcnnblstmblstmnetworkscombiningwithmultipleconvolutionalneuralnetworkmodelstopredictrnaproteininteractionsusingmultiplebiometricfeaturescodes
_version_ 1724182222909472768