A novel method to predict essential proteins based on tensor and HITS algorithm

Abstract Background Essential proteins are an important part of the cell and closely related to the life activities of the cell. Hitherto, Protein-Protein Interaction (PPI) networks have been adopted by many computational methods to predict essential proteins. Most of the current approaches focus ma...

Full description

Bibliographic Details
Main Authors: Zhihong Zhang, Yingchun Luo, Sai Hu, Xueyong Li, Lei Wang, Bihai Zhao
Format: Article
Language:English
Published: BMC 2020-04-01
Series:Human Genomics
Online Access:http://link.springer.com/article/10.1186/s40246-020-00263-7
id doaj-3624fbb1b9dc42ac996011bed3a712e4
record_format Article
spelling doaj-3624fbb1b9dc42ac996011bed3a712e42020-11-25T01:47:55ZengBMCHuman Genomics1479-73642020-04-0114111210.1186/s40246-020-00263-7A novel method to predict essential proteins based on tensor and HITS algorithmZhihong Zhang0Yingchun Luo1Sai Hu2Xueyong Li3Lei Wang4Bihai Zhao5College of Computer Engineering and Applied Mathematics, Changsha UniversityCollege of Computer Engineering and Applied Mathematics, Changsha UniversityCollege of Computer Engineering and Applied Mathematics, Changsha UniversityCollege of Computer Engineering and Applied Mathematics, Changsha UniversityCollege of Computer Engineering and Applied Mathematics, Changsha UniversityCollege of Computer Engineering and Applied Mathematics, Changsha UniversityAbstract Background Essential proteins are an important part of the cell and closely related to the life activities of the cell. Hitherto, Protein-Protein Interaction (PPI) networks have been adopted by many computational methods to predict essential proteins. Most of the current approaches focus mainly on the topological structure of PPI networks. However, those methods relying solely on the PPI network have low detection accuracy for essential proteins. Therefore, it is necessary to integrate the PPI network with other biological information to identify essential proteins. Results In this paper, we proposed a novel random walk method for identifying essential proteins, called HEPT. A three-dimensional tensor is constructed first by combining the PPI network of Saccharomyces cerevisiae with multiple biological data such as gene ontology annotations and protein domains. Then, based on the newly constructed tensor, we extended the Hyperlink-Induced Topic Search (HITS) algorithm from a two-dimensional to a three-dimensional tensor model that can be utilized to infer essential proteins. Different from existing state-of-the-art methods, the importance of proteins and the types of interactions will both contribute to the essential protein prediction. To evaluate the performance of our newly proposed HEPT method, proteins are ranked in the descending order based on their ranking scores computed by our method and other competitive methods. After that, a certain number of the ranked proteins are selected as candidates for essential proteins. According to the list of known essential proteins, the number of true essential proteins is used to judge the performance of each method. Experimental results show that our method can achieve better prediction performance in comparison with other nine state-of-the-art methods in identifying essential proteins. Conclusions Through analysis and experimental results, it is obvious that HEPT can be used to effectively improve the prediction accuracy of essential proteins by the use of HITS algorithm and the combination of network topology with gene ontology annotations and protein domains, which provides a new insight into multi-data source fusion.http://link.springer.com/article/10.1186/s40246-020-00263-7
collection DOAJ
language English
format Article
sources DOAJ
author Zhihong Zhang
Yingchun Luo
Sai Hu
Xueyong Li
Lei Wang
Bihai Zhao
spellingShingle Zhihong Zhang
Yingchun Luo
Sai Hu
Xueyong Li
Lei Wang
Bihai Zhao
A novel method to predict essential proteins based on tensor and HITS algorithm
Human Genomics
author_facet Zhihong Zhang
Yingchun Luo
Sai Hu
Xueyong Li
Lei Wang
Bihai Zhao
author_sort Zhihong Zhang
title A novel method to predict essential proteins based on tensor and HITS algorithm
title_short A novel method to predict essential proteins based on tensor and HITS algorithm
title_full A novel method to predict essential proteins based on tensor and HITS algorithm
title_fullStr A novel method to predict essential proteins based on tensor and HITS algorithm
title_full_unstemmed A novel method to predict essential proteins based on tensor and HITS algorithm
title_sort novel method to predict essential proteins based on tensor and hits algorithm
publisher BMC
series Human Genomics
issn 1479-7364
publishDate 2020-04-01
description Abstract Background Essential proteins are an important part of the cell and closely related to the life activities of the cell. Hitherto, Protein-Protein Interaction (PPI) networks have been adopted by many computational methods to predict essential proteins. Most of the current approaches focus mainly on the topological structure of PPI networks. However, those methods relying solely on the PPI network have low detection accuracy for essential proteins. Therefore, it is necessary to integrate the PPI network with other biological information to identify essential proteins. Results In this paper, we proposed a novel random walk method for identifying essential proteins, called HEPT. A three-dimensional tensor is constructed first by combining the PPI network of Saccharomyces cerevisiae with multiple biological data such as gene ontology annotations and protein domains. Then, based on the newly constructed tensor, we extended the Hyperlink-Induced Topic Search (HITS) algorithm from a two-dimensional to a three-dimensional tensor model that can be utilized to infer essential proteins. Different from existing state-of-the-art methods, the importance of proteins and the types of interactions will both contribute to the essential protein prediction. To evaluate the performance of our newly proposed HEPT method, proteins are ranked in the descending order based on their ranking scores computed by our method and other competitive methods. After that, a certain number of the ranked proteins are selected as candidates for essential proteins. According to the list of known essential proteins, the number of true essential proteins is used to judge the performance of each method. Experimental results show that our method can achieve better prediction performance in comparison with other nine state-of-the-art methods in identifying essential proteins. Conclusions Through analysis and experimental results, it is obvious that HEPT can be used to effectively improve the prediction accuracy of essential proteins by the use of HITS algorithm and the combination of network topology with gene ontology annotations and protein domains, which provides a new insight into multi-data source fusion.
url http://link.springer.com/article/10.1186/s40246-020-00263-7
work_keys_str_mv AT zhihongzhang anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT yingchunluo anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT saihu anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT xueyongli anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT leiwang anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT bihaizhao anovelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT zhihongzhang novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT yingchunluo novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT saihu novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT xueyongli novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT leiwang novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
AT bihaizhao novelmethodtopredictessentialproteinsbasedontensorandhitsalgorithm
_version_ 1725013931429199872