An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks

Network representation learning aims to learn low-dimensional, compressible, and distributed representational vectors of nodes in networks. Due to the expensive costs of obtaining label information of nodes in networks, many unsupervised network representation learning methods have been proposed, wh...

Full description

Bibliographic Details
Main Authors: Xin Xu, Yang Lu, Yupeng Zhou, Zhiguo Fu, Yanjie Fu, Minghao Yin
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/9/15/1767
id doaj-258b7f4062d24818861770a771ad667c
record_format Article
spelling doaj-258b7f4062d24818861770a771ad667c2021-08-06T15:28:21ZengMDPI AGMathematics2227-73902021-07-0191767176710.3390/math9151767An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification TasksXin Xu0Yang Lu1Yupeng Zhou2Zhiguo Fu3Yanjie Fu4Minghao Yin5Department of Computer Science, College of Information Science and Technology, Northeast Normal University, Changchun 130117, ChinaDepartment of Computer Science, College of Information Science and Technology, Northeast Normal University, Changchun 130117, ChinaDepartment of Computer Science, College of Information Science and Technology, Northeast Normal University, Changchun 130117, ChinaDepartment of Computer Science, College of Information Science and Technology, Northeast Normal University, Changchun 130117, ChinaDepartment of Computer Science, College of Engineering and Computer Science, University of Central Florida, Orlando, FL 32816, USADepartment of Computer Science, College of Information Science and Technology, Northeast Normal University, Changchun 130117, ChinaNetwork representation learning aims to learn low-dimensional, compressible, and distributed representational vectors of nodes in networks. Due to the expensive costs of obtaining label information of nodes in networks, many unsupervised network representation learning methods have been proposed, where random walk strategy is one of the wildly utilized approaches. However, the existing random walk based methods have some challenges, including: 1. The insufficiency of explaining what network knowledge in the walking path-samplings; 2. The adverse effects caused by the mixture of different information in networks; 3. The poor generality of the methods with hyper-parameters on different networks. This paper proposes an information-explainable random walk based unsupervised network representation learning framework named Probabilistic Accepted Walk (PAW) to obtain network representation from the perspective of the stationary distribution of networks. In the framework, we design two stationary distributions based on nodes’ self-information and local-information of networks to guide our proposed random walk strategy to learn representational vectors of networks through sampling paths of nodes. Numerous experimental results demonstrated that the PAW could obtain more expressive representation than the other six widely used unsupervised network representation learning baselines on four real-world networks in single-label and multi-label node classification tasks.https://www.mdpi.com/2227-7390/9/15/1767network representation learningrandom walkstationary distributionsunsupervised learningnetwork embedding
collection DOAJ
language English
format Article
sources DOAJ
author Xin Xu
Yang Lu
Yupeng Zhou
Zhiguo Fu
Yanjie Fu
Minghao Yin
spellingShingle Xin Xu
Yang Lu
Yupeng Zhou
Zhiguo Fu
Yanjie Fu
Minghao Yin
An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks
Mathematics
network representation learning
random walk
stationary distributions
unsupervised learning
network embedding
author_facet Xin Xu
Yang Lu
Yupeng Zhou
Zhiguo Fu
Yanjie Fu
Minghao Yin
author_sort Xin Xu
title An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks
title_short An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks
title_full An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks
title_fullStr An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks
title_full_unstemmed An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks
title_sort information-explainable random walk based unsupervised network representation learning framework on node classification tasks
publisher MDPI AG
series Mathematics
issn 2227-7390
publishDate 2021-07-01
description Network representation learning aims to learn low-dimensional, compressible, and distributed representational vectors of nodes in networks. Due to the expensive costs of obtaining label information of nodes in networks, many unsupervised network representation learning methods have been proposed, where random walk strategy is one of the wildly utilized approaches. However, the existing random walk based methods have some challenges, including: 1. The insufficiency of explaining what network knowledge in the walking path-samplings; 2. The adverse effects caused by the mixture of different information in networks; 3. The poor generality of the methods with hyper-parameters on different networks. This paper proposes an information-explainable random walk based unsupervised network representation learning framework named Probabilistic Accepted Walk (PAW) to obtain network representation from the perspective of the stationary distribution of networks. In the framework, we design two stationary distributions based on nodes’ self-information and local-information of networks to guide our proposed random walk strategy to learn representational vectors of networks through sampling paths of nodes. Numerous experimental results demonstrated that the PAW could obtain more expressive representation than the other six widely used unsupervised network representation learning baselines on four real-world networks in single-label and multi-label node classification tasks.
topic network representation learning
random walk
stationary distributions
unsupervised learning
network embedding
url https://www.mdpi.com/2227-7390/9/15/1767
work_keys_str_mv AT xinxu aninformationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT yanglu aninformationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT yupengzhou aninformationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT zhiguofu aninformationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT yanjiefu aninformationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT minghaoyin aninformationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT xinxu informationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT yanglu informationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT yupengzhou informationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT zhiguofu informationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT yanjiefu informationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
AT minghaoyin informationexplainablerandomwalkbasedunsupervisednetworkrepresentationlearningframeworkonnodeclassificationtasks
_version_ 1721217894472220672