Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction

Abstract Background Exploring the relationship between disease and gene is of great significance for understanding the pathogenesis of disease and developing corresponding therapeutic measures. The prediction of disease-gene association by computational methods accelerates the process. Results Many...

Full description

Bibliographic Details
Main Authors: Ming He, Chen Huang, Bo Liu, Yadong Wang, Junyi Li
Format: Article
Language:English
Published: BMC 2021-03-01
Series:BMC Bioinformatics
Subjects:
Online Access:https://doi.org/10.1186/s12859-021-04099-3
id doaj-061b531992954a999a5977506925ad9d
record_format Article
spelling doaj-061b531992954a999a5977506925ad9d2021-04-04T11:45:31ZengBMCBMC Bioinformatics1471-21052021-03-0122111510.1186/s12859-021-04099-3Factor graph-aggregated heterogeneous network embedding for disease-gene association predictionMing He0Chen Huang1Bo Liu2Yadong Wang3Junyi Li4School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen)School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen)Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of TechnologySchool of Computer Science and Technology, Harbin Institute of Technology (Shenzhen)School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen)Abstract Background Exploring the relationship between disease and gene is of great significance for understanding the pathogenesis of disease and developing corresponding therapeutic measures. The prediction of disease-gene association by computational methods accelerates the process. Results Many existing methods cannot fully utilize the multi-dimensional biological entity relationship to predict disease-gene association due to multi-source heterogeneous data. This paper proposes FactorHNE, a factor graph-aggregated heterogeneous network embedding method for disease-gene association prediction, which captures a variety of semantic relationships between the heterogeneous nodes by factorization. It produces different semantic factor graphs and effectively aggregates a variety of semantic relationships, by using end-to-end multi-perspectives loss function to optimize model. Then it produces good nodes embedding to prediction disease-gene association. Conclusions Experimental verification and analysis show FactorHNE has better performance and scalability than the existing models. It also has good interpretability and can be extended to large-scale biomedical network data analysis.https://doi.org/10.1186/s12859-021-04099-3Disease-gene association predictionHeterogeneous networkGraph neural networkFactorization
collection DOAJ
language English
format Article
sources DOAJ
author Ming He
Chen Huang
Bo Liu
Yadong Wang
Junyi Li
spellingShingle Ming He
Chen Huang
Bo Liu
Yadong Wang
Junyi Li
Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
BMC Bioinformatics
Disease-gene association prediction
Heterogeneous network
Graph neural network
Factorization
author_facet Ming He
Chen Huang
Bo Liu
Yadong Wang
Junyi Li
author_sort Ming He
title Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
title_short Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
title_full Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
title_fullStr Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
title_full_unstemmed Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
title_sort factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
publisher BMC
series BMC Bioinformatics
issn 1471-2105
publishDate 2021-03-01
description Abstract Background Exploring the relationship between disease and gene is of great significance for understanding the pathogenesis of disease and developing corresponding therapeutic measures. The prediction of disease-gene association by computational methods accelerates the process. Results Many existing methods cannot fully utilize the multi-dimensional biological entity relationship to predict disease-gene association due to multi-source heterogeneous data. This paper proposes FactorHNE, a factor graph-aggregated heterogeneous network embedding method for disease-gene association prediction, which captures a variety of semantic relationships between the heterogeneous nodes by factorization. It produces different semantic factor graphs and effectively aggregates a variety of semantic relationships, by using end-to-end multi-perspectives loss function to optimize model. Then it produces good nodes embedding to prediction disease-gene association. Conclusions Experimental verification and analysis show FactorHNE has better performance and scalability than the existing models. It also has good interpretability and can be extended to large-scale biomedical network data analysis.
topic Disease-gene association prediction
Heterogeneous network
Graph neural network
Factorization
url https://doi.org/10.1186/s12859-021-04099-3
work_keys_str_mv AT minghe factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction
AT chenhuang factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction
AT boliu factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction
AT yadongwang factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction
AT junyili factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction
_version_ 1721542335189221376