An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms
碩士 === 國立交通大學 === 電子工程學系 電子研究所 === 104 === Hadoop is a widely adopted distributed processing framework. The Hadoop framework achieves good portability by assuming each computing node a conventional CPU-based system with local memory. However, the current flow of this framework cannot effectively tak...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/nuu8t2 |
id |
ndltd-TW-104NCTU5428155 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104NCTU54281552019-05-15T23:08:42Z http://ndltd.ncl.edu.tw/handle/nuu8t2 An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms 針對 Hadoop 在嵌入式異質多核心平台之低功耗設計流程 Chen,Sheng-Yen 陳聖諺 碩士 國立交通大學 電子工程學系 電子研究所 104 Hadoop is a widely adopted distributed processing framework. The Hadoop framework achieves good portability by assuming each computing node a conventional CPU-based system with local memory. However, the current flow of this framework cannot effectively take full advantage of an embedded heterogeneous many-core platform. The main challenge stems from the mismatch of data collection and management paradigms between the Hadoop environment and embedded heterogeneous systems. To address the above design concerns, this paper proposes a workflow that enables Hadoop applications to efficiently leverage the distributed embedded heterogeneous many-core systems. By taking the same data layout of conventional Hadoop applications, the proposed flow introduces efficient manners to collect and manage the fine-grained data chunks. Using Principle Component Analysis (PCA) as an application driver, the proposed Hadoop design on a Tegra K1 cluster has achieved 6.4x performance enhancement when running the PCA analysis on an input matrix of 16K × 16K data. The proposed design also demonstrated much better energy efficiency when compared with the designs on conventional PC-based clusters. 賴伯承 2016 學位論文 ; thesis 39 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 電子工程學系 電子研究所 === 104 === Hadoop is a widely adopted distributed processing framework. The Hadoop framework achieves good portability by assuming each computing node a conventional CPU-based system with local memory. However, the current flow of this framework cannot effectively take full advantage of an embedded heterogeneous many-core platform. The main challenge stems from the mismatch of data collection and management paradigms between the Hadoop environment and embedded heterogeneous systems. To address the above design concerns, this paper proposes a workflow that enables Hadoop applications to efficiently leverage the distributed embedded heterogeneous many-core systems. By taking the same data layout of conventional Hadoop applications, the proposed flow introduces efficient manners to collect and manage the fine-grained data chunks. Using Principle Component Analysis (PCA) as an application driver, the proposed Hadoop design on a Tegra K1 cluster has achieved 6.4x performance enhancement when running the PCA analysis on an input matrix of 16K × 16K data. The proposed design also demonstrated much better energy efficiency when compared with the designs on conventional PC-based clusters.
|
author2 |
賴伯承 |
author_facet |
賴伯承 Chen,Sheng-Yen 陳聖諺 |
author |
Chen,Sheng-Yen 陳聖諺 |
spellingShingle |
Chen,Sheng-Yen 陳聖諺 An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms |
author_sort |
Chen,Sheng-Yen |
title |
An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms |
title_short |
An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms |
title_full |
An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms |
title_fullStr |
An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms |
title_full_unstemmed |
An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms |
title_sort |
efficient design flow for hadoop on embedded heterogeneous platforms |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/nuu8t2 |
work_keys_str_mv |
AT chenshengyen anefficientdesignflowforhadooponembeddedheterogeneousplatforms AT chénshèngyàn anefficientdesignflowforhadooponembeddedheterogeneousplatforms AT chenshengyen zhēnduìhadoopzàiqiànrùshìyìzhìduōhéxīnpíngtáizhīdīgōnghàoshèjìliúchéng AT chénshèngyàn zhēnduìhadoopzàiqiànrùshìyìzhìduōhéxīnpíngtáizhīdīgōnghàoshèjìliúchéng AT chenshengyen efficientdesignflowforhadooponembeddedheterogeneousplatforms AT chénshèngyàn efficientdesignflowforhadooponembeddedheterogeneousplatforms |
_version_ |
1719140580113514496 |