Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.

Due to the demand for performance improvement and the existence of prior information, semi-supervised community detection with pairwise constraints becomes a hot topic. Most existing methods have been successfully encoding the must-link constraints, but neglect the opposite ones, i.e., the cannot-li...

Full description

Bibliographic Details
Main Authors: Liang Yang, Meng Ge, Di Jin, Dongxiao He, Huazhu Fu, Jing Wang, Xiaochun Cao
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2017-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5497956?pdf=render
id doaj-ab06ca605d9245d796fd2bdd71cf90a7
record_format Article
spelling doaj-ab06ca605d9245d796fd2bdd71cf90a72020-11-25T01:45:45ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01127e017802910.1371/journal.pone.0178029Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.Liang YangMeng GeDi JinDongxiao HeHuazhu FuJing WangXiaochun CaoDue to the demand for performance improvement and the existence of prior information, semi-supervised community detection with pairwise constraints becomes a hot topic. Most existing methods have been successfully encoding the must-link constraints, but neglect the opposite ones, i.e., the cannot-link constraints, which can force the exclusion between nodes. In this paper, we are interested in understanding the role of cannot-link constraints and effectively encoding pairwise constraints. Towards these goals, we define an integral generative process jointly considering the network topology, must-link and cannot-link constraints. We propose to characterize this process as a Multi-variance Mixed Gaussian Generative (MMGG) Model to address diverse degrees of confidences that exist in network topology and pairwise constraints and formulate it as a weighted nonnegative matrix factorization problem. The experiments on artificial and real-world networks not only illustrate the superiority of our proposed MMGG, but also, most importantly, reveal the roles of pairwise constraints. That is, though the must-link is more important than cannot-link when either of them is available, both must-link and cannot-link are equally important when both of them are available. To the best of our knowledge, this is the first work on discovering and exploring the importance of cannot-link constraints in semi-supervised community detection.http://europepmc.org/articles/PMC5497956?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Liang Yang
Meng Ge
Di Jin
Dongxiao He
Huazhu Fu
Jing Wang
Xiaochun Cao
spellingShingle Liang Yang
Meng Ge
Di Jin
Dongxiao He
Huazhu Fu
Jing Wang
Xiaochun Cao
Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.
PLoS ONE
author_facet Liang Yang
Meng Ge
Di Jin
Dongxiao He
Huazhu Fu
Jing Wang
Xiaochun Cao
author_sort Liang Yang
title Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.
title_short Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.
title_full Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.
title_fullStr Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.
title_full_unstemmed Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.
title_sort exploring the roles of cannot-link constraint in community detection via multi-variance mixed gaussian generative model.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2017-01-01
description Due to the demand for performance improvement and the existence of prior information, semi-supervised community detection with pairwise constraints becomes a hot topic. Most existing methods have been successfully encoding the must-link constraints, but neglect the opposite ones, i.e., the cannot-link constraints, which can force the exclusion between nodes. In this paper, we are interested in understanding the role of cannot-link constraints and effectively encoding pairwise constraints. Towards these goals, we define an integral generative process jointly considering the network topology, must-link and cannot-link constraints. We propose to characterize this process as a Multi-variance Mixed Gaussian Generative (MMGG) Model to address diverse degrees of confidences that exist in network topology and pairwise constraints and formulate it as a weighted nonnegative matrix factorization problem. The experiments on artificial and real-world networks not only illustrate the superiority of our proposed MMGG, but also, most importantly, reveal the roles of pairwise constraints. That is, though the must-link is more important than cannot-link when either of them is available, both must-link and cannot-link are equally important when both of them are available. To the best of our knowledge, this is the first work on discovering and exploring the importance of cannot-link constraints in semi-supervised community detection.
url http://europepmc.org/articles/PMC5497956?pdf=render
work_keys_str_mv AT liangyang exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel
AT mengge exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel
AT dijin exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel
AT dongxiaohe exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel
AT huazhufu exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel
AT jingwang exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel
AT xiaochuncao exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel
_version_ 1725022858845880320