MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama
碩士 === 國立臺灣科技大學 === 資訊工程系 === 104 === The maximum clique mining problem for extremely large graphs has been used in many fields, such as social network, bioinformatics and computational chemistry. Recently, some studies in the literature solve the problem using conventional MapReduce algorithms. Nev...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/00189756932319713552 |
id |
ndltd-TW-104NTUS5392006 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104NTUS53920062017-10-29T04:34:40Z http://ndltd.ncl.edu.tw/handle/00189756932319713552 MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama 使用Apache Hama枚舉可擴展的最大團 Chieh-Hsuan Cheng 鄭捷軒 碩士 國立臺灣科技大學 資訊工程系 104 The maximum clique mining problem for extremely large graphs has been used in many fields, such as social network, bioinformatics and computational chemistry. Recently, some studies in the literature solve the problem using conventional MapReduce algorithms. Nevertheless, those algorithms just use the parallel architecture of MapReduce processing to partition the graph, but still apply sequential algorithms to find the maximum clique in a subgraph. The problem of mining the maximum clique in a graph is not actually solved in a parallel fashion. This paper proposes an innovative scheme to mine the maximum clique in a huge graph by a parallel technique based on Apache Hama, which is a general bulk synchronous parallel (BSP) computing engine on top of Hadoop. Essentially, every vertex iteratively executes the same procedure, including receiving messages from its neighbors, processing the tasks and sending messages to its neighbors. The vertices in a particular clique will be collected in each iteration until no vertex can be added. The maximum cliques are determined among those cliques at the end. Our experimental results demonstrate that our proposed solution is more efficient and more scalable than the existing MapReduce algorithms. Tai-Lin Chin 金台齡 2015 學位論文 ; thesis 36 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 資訊工程系 === 104 === The maximum clique mining problem for extremely large graphs has been used in
many fields, such as social network, bioinformatics and computational chemistry. Recently,
some studies in the literature solve the problem using conventional MapReduce
algorithms. Nevertheless, those algorithms just use the parallel architecture of MapReduce
processing to partition the graph, but still apply sequential algorithms to find the
maximum clique in a subgraph. The problem of mining the maximum clique in a graph
is not actually solved in a parallel fashion. This paper proposes an innovative scheme
to mine the maximum clique in a huge graph by a parallel technique based on Apache
Hama, which is a general bulk synchronous parallel (BSP) computing engine on top of
Hadoop. Essentially, every vertex iteratively executes the same procedure, including receiving
messages from its neighbors, processing the tasks and sending messages to its
neighbors. The vertices in a particular clique will be collected in each iteration until no
vertex can be added. The maximum cliques are determined among those cliques at the
end. Our experimental results demonstrate that our proposed solution is more efficient
and more scalable than the existing MapReduce algorithms.
|
author2 |
Tai-Lin Chin |
author_facet |
Tai-Lin Chin Chieh-Hsuan Cheng 鄭捷軒 |
author |
Chieh-Hsuan Cheng 鄭捷軒 |
spellingShingle |
Chieh-Hsuan Cheng 鄭捷軒 MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama |
author_sort |
Chieh-Hsuan Cheng |
title |
MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama |
title_short |
MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama |
title_full |
MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama |
title_fullStr |
MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama |
title_full_unstemmed |
MCE-P:Scalable Maximum Clique Enumeration Using Apache Hama |
title_sort |
mce-p:scalable maximum clique enumeration using apache hama |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/00189756932319713552 |
work_keys_str_mv |
AT chiehhsuancheng mcepscalablemaximumcliqueenumerationusingapachehama AT zhèngjiéxuān mcepscalablemaximumcliqueenumerationusingapachehama AT chiehhsuancheng shǐyòngapachehamaméijǔkěkuòzhǎndezuìdàtuán AT zhèngjiéxuān shǐyòngapachehamaméijǔkěkuòzhǎndezuìdàtuán |
_version_ |
1718558221268942848 |