Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.
Due to the demand for performance improvement and the existence of prior information, semi-supervised community detection with pairwise constraints becomes a hot topic. Most existing methods have been successfully encoding the must-link constraints, but neglect the opposite ones, i.e., the cannot-li...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2017-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC5497956?pdf=render |
id |
doaj-ab06ca605d9245d796fd2bdd71cf90a7 |
---|---|
record_format |
Article |
spelling |
doaj-ab06ca605d9245d796fd2bdd71cf90a72020-11-25T01:45:45ZengPublic Library of Science (PLoS)PLoS ONE1932-62032017-01-01127e017802910.1371/journal.pone.0178029Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model.Liang YangMeng GeDi JinDongxiao HeHuazhu FuJing WangXiaochun CaoDue to the demand for performance improvement and the existence of prior information, semi-supervised community detection with pairwise constraints becomes a hot topic. Most existing methods have been successfully encoding the must-link constraints, but neglect the opposite ones, i.e., the cannot-link constraints, which can force the exclusion between nodes. In this paper, we are interested in understanding the role of cannot-link constraints and effectively encoding pairwise constraints. Towards these goals, we define an integral generative process jointly considering the network topology, must-link and cannot-link constraints. We propose to characterize this process as a Multi-variance Mixed Gaussian Generative (MMGG) Model to address diverse degrees of confidences that exist in network topology and pairwise constraints and formulate it as a weighted nonnegative matrix factorization problem. The experiments on artificial and real-world networks not only illustrate the superiority of our proposed MMGG, but also, most importantly, reveal the roles of pairwise constraints. That is, though the must-link is more important than cannot-link when either of them is available, both must-link and cannot-link are equally important when both of them are available. To the best of our knowledge, this is the first work on discovering and exploring the importance of cannot-link constraints in semi-supervised community detection.http://europepmc.org/articles/PMC5497956?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Liang Yang Meng Ge Di Jin Dongxiao He Huazhu Fu Jing Wang Xiaochun Cao |
spellingShingle |
Liang Yang Meng Ge Di Jin Dongxiao He Huazhu Fu Jing Wang Xiaochun Cao Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model. PLoS ONE |
author_facet |
Liang Yang Meng Ge Di Jin Dongxiao He Huazhu Fu Jing Wang Xiaochun Cao |
author_sort |
Liang Yang |
title |
Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model. |
title_short |
Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model. |
title_full |
Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model. |
title_fullStr |
Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model. |
title_full_unstemmed |
Exploring the roles of cannot-link constraint in community detection via Multi-variance Mixed Gaussian Generative Model. |
title_sort |
exploring the roles of cannot-link constraint in community detection via multi-variance mixed gaussian generative model. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2017-01-01 |
description |
Due to the demand for performance improvement and the existence of prior information, semi-supervised community detection with pairwise constraints becomes a hot topic. Most existing methods have been successfully encoding the must-link constraints, but neglect the opposite ones, i.e., the cannot-link constraints, which can force the exclusion between nodes. In this paper, we are interested in understanding the role of cannot-link constraints and effectively encoding pairwise constraints. Towards these goals, we define an integral generative process jointly considering the network topology, must-link and cannot-link constraints. We propose to characterize this process as a Multi-variance Mixed Gaussian Generative (MMGG) Model to address diverse degrees of confidences that exist in network topology and pairwise constraints and formulate it as a weighted nonnegative matrix factorization problem. The experiments on artificial and real-world networks not only illustrate the superiority of our proposed MMGG, but also, most importantly, reveal the roles of pairwise constraints. That is, though the must-link is more important than cannot-link when either of them is available, both must-link and cannot-link are equally important when both of them are available. To the best of our knowledge, this is the first work on discovering and exploring the importance of cannot-link constraints in semi-supervised community detection. |
url |
http://europepmc.org/articles/PMC5497956?pdf=render |
work_keys_str_mv |
AT liangyang exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel AT mengge exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel AT dijin exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel AT dongxiaohe exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel AT huazhufu exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel AT jingwang exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel AT xiaochuncao exploringtherolesofcannotlinkconstraintincommunitydetectionviamultivariancemixedgaussiangenerativemodel |
_version_ |
1725022858845880320 |