Completing sparse and disconnected protein-protein network by deep learning

Abstract Background Protein-protein interaction (PPI) prediction remains a central task in systems biology to achieve a better and holistic understanding of cellular and intracellular processes. Recently, an increasing number of computational methods have shifted from pair-wise prediction to network...

Full description

Bibliographic Details
Main Authors: Lei Huang, Li Liao, Cathy H. Wu
Format: Article
Language:English
Published: BMC 2018-03-01
Series:BMC Bioinformatics
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12859-018-2112-7
id doaj-710895d358464d6daa6ab9c2eb42f76a
record_format Article
spelling doaj-710895d358464d6daa6ab9c2eb42f76a2020-11-24T22:01:43ZengBMCBMC Bioinformatics1471-21052018-03-0119111210.1186/s12859-018-2112-7Completing sparse and disconnected protein-protein network by deep learningLei Huang0Li Liao1Cathy H. Wu2Department of Computer and Information Sciences, University of DelawareDepartment of Computer and Information Sciences, University of DelawareDepartment of Computer and Information Sciences, University of DelawareAbstract Background Protein-protein interaction (PPI) prediction remains a central task in systems biology to achieve a better and holistic understanding of cellular and intracellular processes. Recently, an increasing number of computational methods have shifted from pair-wise prediction to network level prediction. Many of the existing network level methods predict PPIs under the assumption that the training network should be connected. However, this assumption greatly affects the prediction power and limits the application area because the current golden standard PPI networks are usually very sparse and disconnected. Therefore, how to effectively predict PPIs based on a training network that is sparse and disconnected remains a challenge. Results In this work, we developed a novel PPI prediction method based on deep learning neural network and regularized Laplacian kernel. We use a neural network with an autoencoder-like architecture to implicitly simulate the evolutionary processes of a PPI network. Neurons of the output layer correspond to proteins and are labeled with values (1 for interaction and 0 for otherwise) from the adjacency matrix of a sparse disconnected training PPI network. Unlike autoencoder, neurons at the input layer are given all zero input, reflecting an assumption of no a priori knowledge about PPIs, and hidden layers of smaller sizes mimic ancient interactome at different times during evolution. After the training step, an evolved PPI network whose rows are outputs of the neural network can be obtained. We then predict PPIs by applying the regularized Laplacian kernel to the transition matrix that is built upon the evolved PPI network. The results from cross-validation experiments show that the PPI prediction accuracies for yeast data and human data measured as AUC are increased by up to 8.4 and 14.9% respectively, as compared to the baseline. Moreover, the evolved PPI network can also help us leverage complementary information from the disconnected training network and multiple heterogeneous data sources. Tested by the yeast data with six heterogeneous feature kernels, the results show our method can further improve the prediction performance by up to 2%, which is very close to an upper bound that is obtained by an Approximate Bayesian Computation based sampling method. Conclusions The proposed evolution deep neural network, coupled with regularized Laplacian kernel, is an effective tool in completing sparse and disconnected PPI networks and in facilitating integration of heterogeneous data sources.http://link.springer.com/article/10.1186/s12859-018-2112-7Disconnected protein interaction networkNeural networkInteraction predictionNetwork evolutionRegularized Laplacian
collection DOAJ
language English
format Article
sources DOAJ
author Lei Huang
Li Liao
Cathy H. Wu
spellingShingle Lei Huang
Li Liao
Cathy H. Wu
Completing sparse and disconnected protein-protein network by deep learning
BMC Bioinformatics
Disconnected protein interaction network
Neural network
Interaction prediction
Network evolution
Regularized Laplacian
author_facet Lei Huang
Li Liao
Cathy H. Wu
author_sort Lei Huang
title Completing sparse and disconnected protein-protein network by deep learning
title_short Completing sparse and disconnected protein-protein network by deep learning
title_full Completing sparse and disconnected protein-protein network by deep learning
title_fullStr Completing sparse and disconnected protein-protein network by deep learning
title_full_unstemmed Completing sparse and disconnected protein-protein network by deep learning
title_sort completing sparse and disconnected protein-protein network by deep learning
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2018-03-01
description Abstract Background Protein-protein interaction (PPI) prediction remains a central task in systems biology to achieve a better and holistic understanding of cellular and intracellular processes. Recently, an increasing number of computational methods have shifted from pair-wise prediction to network level prediction. Many of the existing network level methods predict PPIs under the assumption that the training network should be connected. However, this assumption greatly affects the prediction power and limits the application area because the current golden standard PPI networks are usually very sparse and disconnected. Therefore, how to effectively predict PPIs based on a training network that is sparse and disconnected remains a challenge. Results In this work, we developed a novel PPI prediction method based on deep learning neural network and regularized Laplacian kernel. We use a neural network with an autoencoder-like architecture to implicitly simulate the evolutionary processes of a PPI network. Neurons of the output layer correspond to proteins and are labeled with values (1 for interaction and 0 for otherwise) from the adjacency matrix of a sparse disconnected training PPI network. Unlike autoencoder, neurons at the input layer are given all zero input, reflecting an assumption of no a priori knowledge about PPIs, and hidden layers of smaller sizes mimic ancient interactome at different times during evolution. After the training step, an evolved PPI network whose rows are outputs of the neural network can be obtained. We then predict PPIs by applying the regularized Laplacian kernel to the transition matrix that is built upon the evolved PPI network. The results from cross-validation experiments show that the PPI prediction accuracies for yeast data and human data measured as AUC are increased by up to 8.4 and 14.9% respectively, as compared to the baseline. Moreover, the evolved PPI network can also help us leverage complementary information from the disconnected training network and multiple heterogeneous data sources. Tested by the yeast data with six heterogeneous feature kernels, the results show our method can further improve the prediction performance by up to 2%, which is very close to an upper bound that is obtained by an Approximate Bayesian Computation based sampling method. Conclusions The proposed evolution deep neural network, coupled with regularized Laplacian kernel, is an effective tool in completing sparse and disconnected PPI networks and in facilitating integration of heterogeneous data sources.
topic Disconnected protein interaction network
Neural network
Interaction prediction
Network evolution
Regularized Laplacian
url http://link.springer.com/article/10.1186/s12859-018-2112-7
work_keys_str_mv AT leihuang completingsparseanddisconnectedproteinproteinnetworkbydeeplearning
AT liliao completingsparseanddisconnectedproteinproteinnetworkbydeeplearning
AT cathyhwu completingsparseanddisconnectedproteinproteinnetworkbydeeplearning
_version_ 1725838903815962624