Identifying influential nodes in large-scale directed networks: the role of clustering.

Identifying influential nodes in very large-scale directed networks is a big challenge relevant to disparate applications, such as accelerating information propagation, controlling rumors and diseases, designing search engines, and understanding hierarchical organization of social and biological net...

Full description

Bibliographic Details
Main Authors: Duan-Bing Chen, Hui Gao, Linyuan Lü, Tao Zhou
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3814409?pdf=render
id doaj-74dce49a05fc43c185e46a6756bff3fd
record_format Article
spelling doaj-74dce49a05fc43c185e46a6756bff3fd2020-11-25T01:18:47ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-01810e7745510.1371/journal.pone.0077455Identifying influential nodes in large-scale directed networks: the role of clustering.Duan-Bing ChenHui GaoLinyuan LüTao ZhouIdentifying influential nodes in very large-scale directed networks is a big challenge relevant to disparate applications, such as accelerating information propagation, controlling rumors and diseases, designing search engines, and understanding hierarchical organization of social and biological networks. Known methods range from node centralities, such as degree, closeness and betweenness, to diffusion-based processes, like PageRank and LeaderRank. Some of these methods already take into account the influences of a node's neighbors but do not directly make use of the interactions among it's neighbors. Local clustering is known to have negative impacts on the information spreading. We further show empirically that it also plays a negative role in generating local connections. Inspired by these facts, we propose a local ranking algorithm named ClusterRank, which takes into account not only the number of neighbors and the neighbors' influences, but also the clustering coefficient. Subject to the susceptible-infected-recovered (SIR) spreading model with constant infectivity, experimental results on two directed networks, a social network extracted from delicious.com and a large-scale short-message communication network, demonstrate that the ClusterRank outperforms some benchmark algorithms such as PageRank and LeaderRank. Furthermore, ClusterRank can also be applied to undirected networks where the superiority of ClusterRank is significant compared with degree centrality and k-core decomposition. In addition, ClusterRank, only making use of local information, is much more efficient than global methods: It takes only 191 seconds for a network with about [Formula: see text] nodes, more than 15 times faster than PageRank.http://europepmc.org/articles/PMC3814409?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Duan-Bing Chen
Hui Gao
Linyuan Lü
Tao Zhou
spellingShingle Duan-Bing Chen
Hui Gao
Linyuan Lü
Tao Zhou
Identifying influential nodes in large-scale directed networks: the role of clustering.
PLoS ONE
author_facet Duan-Bing Chen
Hui Gao
Linyuan Lü
Tao Zhou
author_sort Duan-Bing Chen
title Identifying influential nodes in large-scale directed networks: the role of clustering.
title_short Identifying influential nodes in large-scale directed networks: the role of clustering.
title_full Identifying influential nodes in large-scale directed networks: the role of clustering.
title_fullStr Identifying influential nodes in large-scale directed networks: the role of clustering.
title_full_unstemmed Identifying influential nodes in large-scale directed networks: the role of clustering.
title_sort identifying influential nodes in large-scale directed networks: the role of clustering.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description Identifying influential nodes in very large-scale directed networks is a big challenge relevant to disparate applications, such as accelerating information propagation, controlling rumors and diseases, designing search engines, and understanding hierarchical organization of social and biological networks. Known methods range from node centralities, such as degree, closeness and betweenness, to diffusion-based processes, like PageRank and LeaderRank. Some of these methods already take into account the influences of a node's neighbors but do not directly make use of the interactions among it's neighbors. Local clustering is known to have negative impacts on the information spreading. We further show empirically that it also plays a negative role in generating local connections. Inspired by these facts, we propose a local ranking algorithm named ClusterRank, which takes into account not only the number of neighbors and the neighbors' influences, but also the clustering coefficient. Subject to the susceptible-infected-recovered (SIR) spreading model with constant infectivity, experimental results on two directed networks, a social network extracted from delicious.com and a large-scale short-message communication network, demonstrate that the ClusterRank outperforms some benchmark algorithms such as PageRank and LeaderRank. Furthermore, ClusterRank can also be applied to undirected networks where the superiority of ClusterRank is significant compared with degree centrality and k-core decomposition. In addition, ClusterRank, only making use of local information, is much more efficient than global methods: It takes only 191 seconds for a network with about [Formula: see text] nodes, more than 15 times faster than PageRank.
url http://europepmc.org/articles/PMC3814409?pdf=render
work_keys_str_mv AT duanbingchen identifyinginfluentialnodesinlargescaledirectednetworkstheroleofclustering
AT huigao identifyinginfluentialnodesinlargescaledirectednetworkstheroleofclustering
AT linyuanlu identifyinginfluentialnodesinlargescaledirectednetworkstheroleofclustering
AT taozhou identifyinginfluentialnodesinlargescaledirectednetworkstheroleofclustering
_version_ 1725140336659922944