edge2vec: Representation learning using edge semantics for biomedical knowledge discovery

Abstract Background Representation learning provides new and powerful graph analytical approaches and tools for the highly valued data science challenge of mining knowledge graphs. Since previous graph analytical methods have mostly focused on homogeneous graphs, an important current challenge is ex...

Full description

Bibliographic Details
Main Authors:	Zheng Gao, Gang Fu, Chunping Ouyang, Satoshi Tsutsui, Xiaozhong Liu, Jeremy Yang, Christopher Gessner, Brian Foote, David Wild, Ying Ding, Qi Yu
Format:	Article
Language:	English
Published:	BMC 2019-06-01
Series:	BMC Bioinformatics
Subjects:	Knowledge graph Heterogeneous network Biomedical knowledge discovery Representation learning Graph embedding Node embedding
Online Access:	http://link.springer.com/article/10.1186/s12859-019-2914-2

id	doaj-072b4e415f14457ebcfb4d03f84e5a82
record_format	Article
spelling	doaj-072b4e415f14457ebcfb4d03f84e5a822020-11-25T02:26:48ZengBMCBMC Bioinformatics1471-21052019-06-0120111510.1186/s12859-019-2914-2edge2vec: Representation learning using edge semantics for biomedical knowledge discoveryZheng Gao0Gang Fu1Chunping Ouyang2Satoshi Tsutsui3Xiaozhong Liu4Jeremy Yang5Christopher Gessner6Brian Foote7David Wild8Ying Ding9Qi Yu10School of Informatics, Computing and Engineering, Indiana UniversityMicrosoft CorporationUniversity of South ChinaSchool of Informatics, Computing and Engineering, Indiana UniversitySchool of Informatics, Computing and Engineering, Indiana UniversitySchool of Informatics, Computing and Engineering, Indiana UniversitySchool of Informatics, Computing and Engineering, Indiana UniversityData2Discovery, Inc.School of Informatics, Computing and Engineering, Indiana UniversitySchool of Informatics, Computing and Engineering, Indiana UniversitySchool of Management, Shanxi Medical UniversityAbstract Background Representation learning provides new and powerful graph analytical approaches and tools for the highly valued data science challenge of mining knowledge graphs. Since previous graph analytical methods have mostly focused on homogeneous graphs, an important current challenge is extending this methodology for richly heterogeneous graphs and knowledge domains. The biomedical sciences are such a domain, reflecting the complexity of biology, with entities such as genes, proteins, drugs, diseases, and phenotypes, and relationships such as gene co-expression, biochemical regulation, and biomolecular inhibition or activation. Therefore, the semantics of edges and nodes are critical for representation learning and knowledge discovery in real world biomedical problems. Results In this paper, we propose the edge2vec model, which represents graphs considering edge semantics. An edge-type transition matrix is trained by an Expectation-Maximization approach, and a stochastic gradient descent model is employed to learn node embedding on a heterogeneous graph via the trained transition matrix. edge2vec is validated on three biomedical domain tasks: biomedical entity classification, compound-gene bioactivity prediction, and biomedical information retrieval. Results show that by considering edge-types into node embedding learning in heterogeneous graphs, edge2vec significantly outperforms state-of-the-art models on all three tasks. Conclusions We propose this method for its added value relative to existing graph analytical methodology, and in the real world context of biomedical knowledge discovery applicability.http://link.springer.com/article/10.1186/s12859-019-2914-2Knowledge graphHeterogeneous networkBiomedical knowledge discoveryRepresentation learningGraph embeddingNode embedding
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Zheng Gao Gang Fu Chunping Ouyang Satoshi Tsutsui Xiaozhong Liu Jeremy Yang Christopher Gessner Brian Foote David Wild Ying Ding Qi Yu
spellingShingle	Zheng Gao Gang Fu Chunping Ouyang Satoshi Tsutsui Xiaozhong Liu Jeremy Yang Christopher Gessner Brian Foote David Wild Ying Ding Qi Yu edge2vec: Representation learning using edge semantics for biomedical knowledge discovery BMC Bioinformatics Knowledge graph Heterogeneous network Biomedical knowledge discovery Representation learning Graph embedding Node embedding
author_facet	Zheng Gao Gang Fu Chunping Ouyang Satoshi Tsutsui Xiaozhong Liu Jeremy Yang Christopher Gessner Brian Foote David Wild Ying Ding Qi Yu
author_sort	Zheng Gao
title	edge2vec: Representation learning using edge semantics for biomedical knowledge discovery
title_short	edge2vec: Representation learning using edge semantics for biomedical knowledge discovery
title_full	edge2vec: Representation learning using edge semantics for biomedical knowledge discovery
title_fullStr	edge2vec: Representation learning using edge semantics for biomedical knowledge discovery
title_full_unstemmed	edge2vec: Representation learning using edge semantics for biomedical knowledge discovery
title_sort	edge2vec: representation learning using edge semantics for biomedical knowledge discovery
publisher	BMC
series	BMC Bioinformatics
issn	1471-2105
publishDate	2019-06-01
description	Abstract Background Representation learning provides new and powerful graph analytical approaches and tools for the highly valued data science challenge of mining knowledge graphs. Since previous graph analytical methods have mostly focused on homogeneous graphs, an important current challenge is extending this methodology for richly heterogeneous graphs and knowledge domains. The biomedical sciences are such a domain, reflecting the complexity of biology, with entities such as genes, proteins, drugs, diseases, and phenotypes, and relationships such as gene co-expression, biochemical regulation, and biomolecular inhibition or activation. Therefore, the semantics of edges and nodes are critical for representation learning and knowledge discovery in real world biomedical problems. Results In this paper, we propose the edge2vec model, which represents graphs considering edge semantics. An edge-type transition matrix is trained by an Expectation-Maximization approach, and a stochastic gradient descent model is employed to learn node embedding on a heterogeneous graph via the trained transition matrix. edge2vec is validated on three biomedical domain tasks: biomedical entity classification, compound-gene bioactivity prediction, and biomedical information retrieval. Results show that by considering edge-types into node embedding learning in heterogeneous graphs, edge2vec significantly outperforms state-of-the-art models on all three tasks. Conclusions We propose this method for its added value relative to existing graph analytical methodology, and in the real world context of biomedical knowledge discovery applicability.
topic	Knowledge graph Heterogeneous network Biomedical knowledge discovery Representation learning Graph embedding Node embedding
url	http://link.springer.com/article/10.1186/s12859-019-2914-2
work_keys_str_mv	AT zhenggao edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT gangfu edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT chunpingouyang edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT satoshitsutsui edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT xiaozhongliu edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT jeremyyang edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT christophergessner edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT brianfoote edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT davidwild edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT yingding edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery AT qiyu edge2vecrepresentationlearningusingedgesemanticsforbiomedicalknowledgediscovery
_version_	1724845576254652416

edge2vec: Representation learning using edge semantics for biomedical knowledge discovery

Similar Items