Completing sparse and disconnected protein-protein network by deep learning
Abstract Background Protein-protein interaction (PPI) prediction remains a central task in systems biology to achieve a better and holistic understanding of cellular and intracellular processes. Recently, an increasing number of computational methods have shifted from pair-wise prediction to network...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2018-03-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s12859-018-2112-7 |
id |
doaj-710895d358464d6daa6ab9c2eb42f76a |
---|---|
record_format |
Article |
spelling |
doaj-710895d358464d6daa6ab9c2eb42f76a2020-11-24T22:01:43ZengBMCBMC Bioinformatics1471-21052018-03-0119111210.1186/s12859-018-2112-7Completing sparse and disconnected protein-protein network by deep learningLei Huang0Li Liao1Cathy H. Wu2Department of Computer and Information Sciences, University of DelawareDepartment of Computer and Information Sciences, University of DelawareDepartment of Computer and Information Sciences, University of DelawareAbstract Background Protein-protein interaction (PPI) prediction remains a central task in systems biology to achieve a better and holistic understanding of cellular and intracellular processes. Recently, an increasing number of computational methods have shifted from pair-wise prediction to network level prediction. Many of the existing network level methods predict PPIs under the assumption that the training network should be connected. However, this assumption greatly affects the prediction power and limits the application area because the current golden standard PPI networks are usually very sparse and disconnected. Therefore, how to effectively predict PPIs based on a training network that is sparse and disconnected remains a challenge. Results In this work, we developed a novel PPI prediction method based on deep learning neural network and regularized Laplacian kernel. We use a neural network with an autoencoder-like architecture to implicitly simulate the evolutionary processes of a PPI network. Neurons of the output layer correspond to proteins and are labeled with values (1 for interaction and 0 for otherwise) from the adjacency matrix of a sparse disconnected training PPI network. Unlike autoencoder, neurons at the input layer are given all zero input, reflecting an assumption of no a priori knowledge about PPIs, and hidden layers of smaller sizes mimic ancient interactome at different times during evolution. After the training step, an evolved PPI network whose rows are outputs of the neural network can be obtained. We then predict PPIs by applying the regularized Laplacian kernel to the transition matrix that is built upon the evolved PPI network. The results from cross-validation experiments show that the PPI prediction accuracies for yeast data and human data measured as AUC are increased by up to 8.4 and 14.9% respectively, as compared to the baseline. Moreover, the evolved PPI network can also help us leverage complementary information from the disconnected training network and multiple heterogeneous data sources. Tested by the yeast data with six heterogeneous feature kernels, the results show our method can further improve the prediction performance by up to 2%, which is very close to an upper bound that is obtained by an Approximate Bayesian Computation based sampling method. Conclusions The proposed evolution deep neural network, coupled with regularized Laplacian kernel, is an effective tool in completing sparse and disconnected PPI networks and in facilitating integration of heterogeneous data sources.http://link.springer.com/article/10.1186/s12859-018-2112-7Disconnected protein interaction networkNeural networkInteraction predictionNetwork evolutionRegularized Laplacian |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lei Huang Li Liao Cathy H. Wu |
spellingShingle |
Lei Huang Li Liao Cathy H. Wu Completing sparse and disconnected protein-protein network by deep learning BMC Bioinformatics Disconnected protein interaction network Neural network Interaction prediction Network evolution Regularized Laplacian |
author_facet |
Lei Huang Li Liao Cathy H. Wu |
author_sort |
Lei Huang |
title |
Completing sparse and disconnected protein-protein network by deep learning |
title_short |
Completing sparse and disconnected protein-protein network by deep learning |
title_full |
Completing sparse and disconnected protein-protein network by deep learning |
title_fullStr |
Completing sparse and disconnected protein-protein network by deep learning |
title_full_unstemmed |
Completing sparse and disconnected protein-protein network by deep learning |
title_sort |
completing sparse and disconnected protein-protein network by deep learning |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2018-03-01 |
description |
Abstract Background Protein-protein interaction (PPI) prediction remains a central task in systems biology to achieve a better and holistic understanding of cellular and intracellular processes. Recently, an increasing number of computational methods have shifted from pair-wise prediction to network level prediction. Many of the existing network level methods predict PPIs under the assumption that the training network should be connected. However, this assumption greatly affects the prediction power and limits the application area because the current golden standard PPI networks are usually very sparse and disconnected. Therefore, how to effectively predict PPIs based on a training network that is sparse and disconnected remains a challenge. Results In this work, we developed a novel PPI prediction method based on deep learning neural network and regularized Laplacian kernel. We use a neural network with an autoencoder-like architecture to implicitly simulate the evolutionary processes of a PPI network. Neurons of the output layer correspond to proteins and are labeled with values (1 for interaction and 0 for otherwise) from the adjacency matrix of a sparse disconnected training PPI network. Unlike autoencoder, neurons at the input layer are given all zero input, reflecting an assumption of no a priori knowledge about PPIs, and hidden layers of smaller sizes mimic ancient interactome at different times during evolution. After the training step, an evolved PPI network whose rows are outputs of the neural network can be obtained. We then predict PPIs by applying the regularized Laplacian kernel to the transition matrix that is built upon the evolved PPI network. The results from cross-validation experiments show that the PPI prediction accuracies for yeast data and human data measured as AUC are increased by up to 8.4 and 14.9% respectively, as compared to the baseline. Moreover, the evolved PPI network can also help us leverage complementary information from the disconnected training network and multiple heterogeneous data sources. Tested by the yeast data with six heterogeneous feature kernels, the results show our method can further improve the prediction performance by up to 2%, which is very close to an upper bound that is obtained by an Approximate Bayesian Computation based sampling method. Conclusions The proposed evolution deep neural network, coupled with regularized Laplacian kernel, is an effective tool in completing sparse and disconnected PPI networks and in facilitating integration of heterogeneous data sources. |
topic |
Disconnected protein interaction network Neural network Interaction prediction Network evolution Regularized Laplacian |
url |
http://link.springer.com/article/10.1186/s12859-018-2112-7 |
work_keys_str_mv |
AT leihuang completingsparseanddisconnectedproteinproteinnetworkbydeeplearning AT liliao completingsparseanddisconnectedproteinproteinnetworkbydeeplearning AT cathyhwu completingsparseanddisconnectedproteinproteinnetworkbydeeplearning |
_version_ |
1725838903815962624 |