MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama

碩士 === 國立臺灣科技大學 === 資訊工程系 === 104 === The maximum clique mining problem for extremely large graphs has been used in many fields, such as social network, bioinformatics and computational chemistry. Recently, some studies in the literature solve the problem using conventional MapReduce algorithms. Nev...

Full description

Bibliographic Details
Main Authors: Chieh-Hsuan Cheng, 鄭捷軒
Other Authors: Tai-Lin Chin
Format: Others
Language:en_US
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/00189756932319713552
id ndltd-TW-104NTUS5392006
record_format oai_dc
spelling ndltd-TW-104NTUS53920062017-10-29T04:34:40Z http://ndltd.ncl.edu.tw/handle/00189756932319713552 MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama 使用Apache Hama枚舉可擴展的最大團 Chieh-Hsuan Cheng 鄭捷軒 碩士 國立臺灣科技大學 資訊工程系 104 The maximum clique mining problem for extremely large graphs has been used in many fields, such as social network, bioinformatics and computational chemistry. Recently, some studies in the literature solve the problem using conventional MapReduce algorithms. Nevertheless, those algorithms just use the parallel architecture of MapReduce processing to partition the graph, but still apply sequential algorithms to find the maximum clique in a subgraph. The problem of mining the maximum clique in a graph is not actually solved in a parallel fashion. This paper proposes an innovative scheme to mine the maximum clique in a huge graph by a parallel technique based on Apache Hama, which is a general bulk synchronous parallel (BSP) computing engine on top of Hadoop. Essentially, every vertex iteratively executes the same procedure, including receiving messages from its neighbors, processing the tasks and sending messages to its neighbors. The vertices in a particular clique will be collected in each iteration until no vertex can be added. The maximum cliques are determined among those cliques at the end. Our experimental results demonstrate that our proposed solution is more efficient and more scalable than the existing MapReduce algorithms. Tai-Lin Chin 金台齡 2015 學位論文 ; thesis 36 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 104 === The maximum clique mining problem for extremely large graphs has been used in many fields, such as social network, bioinformatics and computational chemistry. Recently, some studies in the literature solve the problem using conventional MapReduce algorithms. Nevertheless, those algorithms just use the parallel architecture of MapReduce processing to partition the graph, but still apply sequential algorithms to find the maximum clique in a subgraph. The problem of mining the maximum clique in a graph is not actually solved in a parallel fashion. This paper proposes an innovative scheme to mine the maximum clique in a huge graph by a parallel technique based on Apache Hama, which is a general bulk synchronous parallel (BSP) computing engine on top of Hadoop. Essentially, every vertex iteratively executes the same procedure, including receiving messages from its neighbors, processing the tasks and sending messages to its neighbors. The vertices in a particular clique will be collected in each iteration until no vertex can be added. The maximum cliques are determined among those cliques at the end. Our experimental results demonstrate that our proposed solution is more efficient and more scalable than the existing MapReduce algorithms.
author2 Tai-Lin Chin
author_facet Tai-Lin Chin
Chieh-Hsuan Cheng
鄭捷軒
author Chieh-Hsuan Cheng
鄭捷軒
spellingShingle Chieh-Hsuan Cheng
鄭捷軒
MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama
author_sort Chieh-Hsuan Cheng
title MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama
title_short MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama
title_full MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama
title_fullStr MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama
title_full_unstemmed MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama
title_sort mce-p:scalable maximum clique enumeration using apache hama
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/00189756932319713552
work_keys_str_mv AT chiehhsuancheng mcepscalablemaximumcliqueenumerationusingapachehama
AT zhèngjiéxuān mcepscalablemaximumcliqueenumerationusingapachehama
AT chiehhsuancheng shǐyòngapachehamaméijǔkěkuòzhǎndezuìdàtuán
AT zhèngjiéxuān shǐyòngapachehamaméijǔkěkuòzhǎndezuìdàtuán
_version_ 1718558221268942848