An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms

碩士 === 國立交通大學 === 電子工程學系 電子研究所 === 104 === Hadoop is a widely adopted distributed processing framework. The Hadoop framework achieves good portability by assuming each computing node a conventional CPU-based system with local memory. However, the current flow of this framework cannot effectively tak...

Full description

Bibliographic Details
Main Authors: Chen,Sheng-Yen, 陳聖諺
Other Authors: 賴伯承
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/nuu8t2
id ndltd-TW-104NCTU5428155
record_format oai_dc
spelling ndltd-TW-104NCTU54281552019-05-15T23:08:42Z http://ndltd.ncl.edu.tw/handle/nuu8t2 An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms 針對 Hadoop 在嵌入式異質多核心平台之低功耗設計流程 Chen,Sheng-Yen 陳聖諺 碩士 國立交通大學 電子工程學系 電子研究所 104 Hadoop is a widely adopted distributed processing framework. The Hadoop framework achieves good portability by assuming each computing node a conventional CPU-based system with local memory. However, the current flow of this framework cannot effectively take full advantage of an embedded heterogeneous many-core platform. The main challenge stems from the mismatch of data collection and management paradigms between the Hadoop environment and embedded heterogeneous systems. To address the above design concerns, this paper proposes a workflow that enables Hadoop applications to efficiently leverage the distributed embedded heterogeneous many-core systems. By taking the same data layout of conventional Hadoop applications, the proposed flow introduces efficient manners to collect and manage the fine-grained data chunks. Using Principle Component Analysis (PCA) as an application driver, the proposed Hadoop design on a Tegra K1 cluster has achieved 6.4x performance enhancement when running the PCA analysis on an input matrix of 16K × 16K data. The proposed design also demonstrated much better energy efficiency when compared with the designs on conventional PC-based clusters. 賴伯承 2016 學位論文 ; thesis 39 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電子工程學系 電子研究所 === 104 === Hadoop is a widely adopted distributed processing framework. The Hadoop framework achieves good portability by assuming each computing node a conventional CPU-based system with local memory. However, the current flow of this framework cannot effectively take full advantage of an embedded heterogeneous many-core platform. The main challenge stems from the mismatch of data collection and management paradigms between the Hadoop environment and embedded heterogeneous systems. To address the above design concerns, this paper proposes a workflow that enables Hadoop applications to efficiently leverage the distributed embedded heterogeneous many-core systems. By taking the same data layout of conventional Hadoop applications, the proposed flow introduces efficient manners to collect and manage the fine-grained data chunks. Using Principle Component Analysis (PCA) as an application driver, the proposed Hadoop design on a Tegra K1 cluster has achieved 6.4x performance enhancement when running the PCA analysis on an input matrix of 16K × 16K data. The proposed design also demonstrated much better energy efficiency when compared with the designs on conventional PC-based clusters.
author2 賴伯承
author_facet 賴伯承
Chen,Sheng-Yen
陳聖諺
author Chen,Sheng-Yen
陳聖諺
spellingShingle Chen,Sheng-Yen
陳聖諺
An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms
author_sort Chen,Sheng-Yen
title An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms
title_short An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms
title_full An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms
title_fullStr An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms
title_full_unstemmed An Efficient Design Flow for Hadoop on Embedded Heterogeneous Platforms
title_sort efficient design flow for hadoop on embedded heterogeneous platforms
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/nuu8t2
work_keys_str_mv AT chenshengyen anefficientdesignflowforhadooponembeddedheterogeneousplatforms
AT chénshèngyàn anefficientdesignflowforhadooponembeddedheterogeneousplatforms
AT chenshengyen zhēnduìhadoopzàiqiànrùshìyìzhìduōhéxīnpíngtáizhīdīgōnghàoshèjìliúchéng
AT chénshèngyàn zhēnduìhadoopzàiqiànrùshìyìzhìduōhéxīnpíngtáizhīdīgōnghàoshèjìliúchéng
AT chenshengyen efficientdesignflowforhadooponembeddedheterogeneousplatforms
AT chénshèngyàn efficientdesignflowforhadooponembeddedheterogeneousplatforms
_version_ 1719140580113514496