Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction
Abstract Background Exploring the relationship between disease and gene is of great significance for understanding the pathogenesis of disease and developing corresponding therapeutic measures. The prediction of disease-gene association by computational methods accelerates the process. Results Many...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-03-01
|
Series: | BMC Bioinformatics |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12859-021-04099-3 |
id |
doaj-061b531992954a999a5977506925ad9d |
---|---|
record_format |
Article |
spelling |
doaj-061b531992954a999a5977506925ad9d2021-04-04T11:45:31ZengBMCBMC Bioinformatics1471-21052021-03-0122111510.1186/s12859-021-04099-3Factor graph-aggregated heterogeneous network embedding for disease-gene association predictionMing He0Chen Huang1Bo Liu2Yadong Wang3Junyi Li4School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen)School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen)Center for Bioinformatics, School of Computer Science and Technology, Harbin Institute of TechnologySchool of Computer Science and Technology, Harbin Institute of Technology (Shenzhen)School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen)Abstract Background Exploring the relationship between disease and gene is of great significance for understanding the pathogenesis of disease and developing corresponding therapeutic measures. The prediction of disease-gene association by computational methods accelerates the process. Results Many existing methods cannot fully utilize the multi-dimensional biological entity relationship to predict disease-gene association due to multi-source heterogeneous data. This paper proposes FactorHNE, a factor graph-aggregated heterogeneous network embedding method for disease-gene association prediction, which captures a variety of semantic relationships between the heterogeneous nodes by factorization. It produces different semantic factor graphs and effectively aggregates a variety of semantic relationships, by using end-to-end multi-perspectives loss function to optimize model. Then it produces good nodes embedding to prediction disease-gene association. Conclusions Experimental verification and analysis show FactorHNE has better performance and scalability than the existing models. It also has good interpretability and can be extended to large-scale biomedical network data analysis.https://doi.org/10.1186/s12859-021-04099-3Disease-gene association predictionHeterogeneous networkGraph neural networkFactorization |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ming He Chen Huang Bo Liu Yadong Wang Junyi Li |
spellingShingle |
Ming He Chen Huang Bo Liu Yadong Wang Junyi Li Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction BMC Bioinformatics Disease-gene association prediction Heterogeneous network Graph neural network Factorization |
author_facet |
Ming He Chen Huang Bo Liu Yadong Wang Junyi Li |
author_sort |
Ming He |
title |
Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction |
title_short |
Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction |
title_full |
Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction |
title_fullStr |
Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction |
title_full_unstemmed |
Factor graph-aggregated heterogeneous network embedding for disease-gene association prediction |
title_sort |
factor graph-aggregated heterogeneous network embedding for disease-gene association prediction |
publisher |
BMC |
series |
BMC Bioinformatics |
issn |
1471-2105 |
publishDate |
2021-03-01 |
description |
Abstract Background Exploring the relationship between disease and gene is of great significance for understanding the pathogenesis of disease and developing corresponding therapeutic measures. The prediction of disease-gene association by computational methods accelerates the process. Results Many existing methods cannot fully utilize the multi-dimensional biological entity relationship to predict disease-gene association due to multi-source heterogeneous data. This paper proposes FactorHNE, a factor graph-aggregated heterogeneous network embedding method for disease-gene association prediction, which captures a variety of semantic relationships between the heterogeneous nodes by factorization. It produces different semantic factor graphs and effectively aggregates a variety of semantic relationships, by using end-to-end multi-perspectives loss function to optimize model. Then it produces good nodes embedding to prediction disease-gene association. Conclusions Experimental verification and analysis show FactorHNE has better performance and scalability than the existing models. It also has good interpretability and can be extended to large-scale biomedical network data analysis. |
topic |
Disease-gene association prediction Heterogeneous network Graph neural network Factorization |
url |
https://doi.org/10.1186/s12859-021-04099-3 |
work_keys_str_mv |
AT minghe factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction AT chenhuang factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction AT boliu factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction AT yadongwang factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction AT junyili factorgraphaggregatedheterogeneousnetworkembeddingfordiseasegeneassociationprediction |
_version_ |
1721542335189221376 |