L γ -PageRank for semi-supervised learning

Abstract PageRank for Semi-Supervised Learning has shown to leverage data structures and limited tagged examples to yield meaningful classification. Despite successes, classification performance can still be improved, particularly in cases of graphs with unclear clusters or unbalanced labeled data....

Full description

Bibliographic Details
Main Authors: Esteban Bautista, Patrice Abry, Paulo Gonçalves
Format: Article
Language:English
Published: SpringerOpen 2019-08-01
Series:Applied Network Science
Subjects:
Online Access:http://link.springer.com/article/10.1007/s41109-019-0172-x
id doaj-bcd5318d2f4546e1a2643b9d59198492
record_format Article
spelling doaj-bcd5318d2f4546e1a2643b9d591984922020-11-25T03:49:38ZengSpringerOpenApplied Network Science2364-82282019-08-014112010.1007/s41109-019-0172-xL γ -PageRank for semi-supervised learningEsteban Bautista0Patrice Abry1Paulo Gonçalves2Univ Lyon, Inria, CNRS, ENS de Lyon, UCB Lyon 1, LIP UMR 5668Univ Lyon, Ens de Lyon, Univ Claude Bernard, CNRS, Laboratoire de PhysiqueUniv Lyon, Inria, CNRS, ENS de Lyon, UCB Lyon 1, LIP UMR 5668Abstract PageRank for Semi-Supervised Learning has shown to leverage data structures and limited tagged examples to yield meaningful classification. Despite successes, classification performance can still be improved, particularly in cases of graphs with unclear clusters or unbalanced labeled data. To address such limitations, a novel approach based on powers of the Laplacian matrix L γ (γ>0), referred to as L γ -PageRank, is proposed. Its theoretical study shows that it operates on signed graphs, where nodes belonging to one same class are more likely to share positive edges while nodes from different classes are more likely to be connected with negative edges. It is shown that by selecting an optimal γ, classification performance can be significantly enhanced. A procedure for the automated estimation of the optimal γ, from a unique observation of data, is devised and assessed. Experiments on several datasets demonstrate the effectiveness of both L γ -PageRank classification and the optimal γ estimation.http://link.springer.com/article/10.1007/s41109-019-0172-xSemi-supervised learningPageRankLaplacian powersDiffusion on graphsSigned graphsOptimal tuning
collection DOAJ
language English
format Article
sources DOAJ
author Esteban Bautista
Patrice Abry
Paulo Gonçalves
spellingShingle Esteban Bautista
Patrice Abry
Paulo Gonçalves
L γ -PageRank for semi-supervised learning
Applied Network Science
Semi-supervised learning
PageRank
Laplacian powers
Diffusion on graphs
Signed graphs
Optimal tuning
author_facet Esteban Bautista
Patrice Abry
Paulo Gonçalves
author_sort Esteban Bautista
title L γ -PageRank for semi-supervised learning
title_short L γ -PageRank for semi-supervised learning
title_full L γ -PageRank for semi-supervised learning
title_fullStr L γ -PageRank for semi-supervised learning
title_full_unstemmed L γ -PageRank for semi-supervised learning
title_sort l γ -pagerank for semi-supervised learning
publisher SpringerOpen
series Applied Network Science
issn 2364-8228
publishDate 2019-08-01
description Abstract PageRank for Semi-Supervised Learning has shown to leverage data structures and limited tagged examples to yield meaningful classification. Despite successes, classification performance can still be improved, particularly in cases of graphs with unclear clusters or unbalanced labeled data. To address such limitations, a novel approach based on powers of the Laplacian matrix L γ (γ>0), referred to as L γ -PageRank, is proposed. Its theoretical study shows that it operates on signed graphs, where nodes belonging to one same class are more likely to share positive edges while nodes from different classes are more likely to be connected with negative edges. It is shown that by selecting an optimal γ, classification performance can be significantly enhanced. A procedure for the automated estimation of the optimal γ, from a unique observation of data, is devised and assessed. Experiments on several datasets demonstrate the effectiveness of both L γ -PageRank classification and the optimal γ estimation.
topic Semi-supervised learning
PageRank
Laplacian powers
Diffusion on graphs
Signed graphs
Optimal tuning
url http://link.springer.com/article/10.1007/s41109-019-0172-x
work_keys_str_mv AT estebanbautista lgpagerankforsemisupervisedlearning
AT patriceabry lgpagerankforsemisupervisedlearning
AT paulogoncalves lgpagerankforsemisupervisedlearning
_version_ 1724494310455377920