A stochastic model for detecting overlapping and hierarchical community structure.

Community detection is a fundamental problem in the analysis of complex networks. Recently, many researchers have concentrated on the detection of overlapping communities, where a vertex may belong to more than one community. However, most current methods require the number (or the size) of the comm...

Full description

Bibliographic Details
Main Authors: Xiaochun Cao, Xiao Wang, Di Jin, Xiaojie Guo, Xianchao Tang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4379187?pdf=render
id doaj-9058ef515a7843ca89efe3fa756e2bef
record_format Article
spelling doaj-9058ef515a7843ca89efe3fa756e2bef2020-11-25T00:24:21ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-01103e011917110.1371/journal.pone.0119171A stochastic model for detecting overlapping and hierarchical community structure.Xiaochun CaoXiao WangDi JinXiaojie GuoXianchao TangCommunity detection is a fundamental problem in the analysis of complex networks. Recently, many researchers have concentrated on the detection of overlapping communities, where a vertex may belong to more than one community. However, most current methods require the number (or the size) of the communities as a priori information, which is usually unavailable in real-world networks. Thus, a practical algorithm should not only find the overlapping community structure, but also automatically determine the number of communities. Furthermore, it is preferable if this method is able to reveal the hierarchical structure of networks as well. In this work, we firstly propose a generative model that employs a nonnegative matrix factorization (NMF) formulization with a l(2,1) norm regularization term, balanced by a resolution parameter. The NMF has the nature that provides overlapping community structure by assigning soft membership variables to each vertex; the l(2,1) regularization term is a technique of group sparsity which can automatically determine the number of communities by penalizing too many nonempty communities; and hence the resolution parameter enables us to explore the hierarchical structure of networks. Thereafter, we derive the multiplicative update rule to learn the model parameters, and offer the proof of its correctness. Finally, we test our approach on a variety of synthetic and real-world networks, and compare it with some state-of-the-art algorithms. The results validate the superior performance of our new method.http://europepmc.org/articles/PMC4379187?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Xiaochun Cao
Xiao Wang
Di Jin
Xiaojie Guo
Xianchao Tang
spellingShingle Xiaochun Cao
Xiao Wang
Di Jin
Xiaojie Guo
Xianchao Tang
A stochastic model for detecting overlapping and hierarchical community structure.
PLoS ONE
author_facet Xiaochun Cao
Xiao Wang
Di Jin
Xiaojie Guo
Xianchao Tang
author_sort Xiaochun Cao
title A stochastic model for detecting overlapping and hierarchical community structure.
title_short A stochastic model for detecting overlapping and hierarchical community structure.
title_full A stochastic model for detecting overlapping and hierarchical community structure.
title_fullStr A stochastic model for detecting overlapping and hierarchical community structure.
title_full_unstemmed A stochastic model for detecting overlapping and hierarchical community structure.
title_sort stochastic model for detecting overlapping and hierarchical community structure.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2015-01-01
description Community detection is a fundamental problem in the analysis of complex networks. Recently, many researchers have concentrated on the detection of overlapping communities, where a vertex may belong to more than one community. However, most current methods require the number (or the size) of the communities as a priori information, which is usually unavailable in real-world networks. Thus, a practical algorithm should not only find the overlapping community structure, but also automatically determine the number of communities. Furthermore, it is preferable if this method is able to reveal the hierarchical structure of networks as well. In this work, we firstly propose a generative model that employs a nonnegative matrix factorization (NMF) formulization with a l(2,1) norm regularization term, balanced by a resolution parameter. The NMF has the nature that provides overlapping community structure by assigning soft membership variables to each vertex; the l(2,1) regularization term is a technique of group sparsity which can automatically determine the number of communities by penalizing too many nonempty communities; and hence the resolution parameter enables us to explore the hierarchical structure of networks. Thereafter, we derive the multiplicative update rule to learn the model parameters, and offer the proof of its correctness. Finally, we test our approach on a variety of synthetic and real-world networks, and compare it with some state-of-the-art algorithms. The results validate the superior performance of our new method.
url http://europepmc.org/articles/PMC4379187?pdf=render
work_keys_str_mv AT xiaochuncao astochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
AT xiaowang astochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
AT dijin astochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
AT xiaojieguo astochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
AT xianchaotang astochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
AT xiaochuncao stochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
AT xiaowang stochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
AT dijin stochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
AT xiaojieguo stochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
AT xianchaotang stochasticmodelfordetectingoverlappingandhierarchicalcommunitystructure
_version_ 1725352435219693568