Research of Subgraph Estimation Page Rank Algorithm for Web Page Rank

The traditional PageRank algorithm can not efficiently perform large data Webpage scheduling problem. This paper proposes an accelerated algorithm named topK-Rank,which is based on PageRank on the MapReduce platform. It can find top k nodes efficiently for a given graph without sacrificing accuracy....

Full description

Bibliographic Details
Main Authors: LI Lan-yin, ZHOU Qiu-Li, KONG Yin, DONG Yi-ming
Format: Article
Language:zho
Published: Harbin University of Science and Technology Publications 2017-04-01
Series:Journal of Harbin University of Science and Technology
Subjects:
Description
Summary:The traditional PageRank algorithm can not efficiently perform large data Webpage scheduling problem. This paper proposes an accelerated algorithm named topK-Rank,which is based on PageRank on the MapReduce platform. It can find top k nodes efficiently for a given graph without sacrificing accuracy. In order to identify top k nodes,topK-Rank algorithm prunes unnecessary nodes and edges in each iteration to dynamically construct subgraphs,and iteratively estimates lower/upper bounds of PageRank scores through subgraphs. Theoretical analysis shows that this method guarantees result exactness. Experiments show that topK-Rank algorithm can find k nodes much faster than the existing approaches.
ISSN:1007-2683