Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems
博士 === 國立交通大學 === 電子工程學系 電子研究所 === 102 === Uncovering information from the large volume of data in a timely manner has been an imperative task in the next wave of computing technologies. Estimating the density distribution of the high dimensional data samples is an effective method to comprehend the...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2014
|
Online Access: | http://ndltd.ncl.edu.tw/handle/wvexk9 |
id |
ndltd-TW-102NCTU5428144 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-102NCTU54281442019-05-15T21:50:57Z http://ndltd.ncl.edu.tw/handle/wvexk9 Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems 異質多核心平台實現巨量資料的貝氏序向切割之分析研究 Li, Kun-Chun 李坤駿 博士 國立交通大學 電子工程學系 電子研究所 102 Uncovering information from the large volume of data in a timely manner has been an imperative task in the next wave of computing technologies. Estimating the density distribution of the high dimensional data samples is an effective method to comprehend the characteristics of the data space. Bayesian Sequential Partitioning (BSP) is a statistically effective density estimation algorithm for high dimensional data. However, BSP is computationally expensive due to complex statistical model, and data intensive when counting the large volume samples. This thesis proposes a high performance design of BSP by leveraging the powerful computation capability of a heterogeneous many-core system. A series of techniques are implemented on both algorithm flow and data management policies. With the proposed approaches, the performance bottleneck is alleviated and the runtime is significantly improved. The overall speedup of the BSP analysis on a heterogeneous many-core system can reach up to 155.3x, when compared with the reference design on a high-end CPU. Lai, Bo-Cheng 賴伯承 2014 學位論文 ; thesis 45 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 國立交通大學 === 電子工程學系 電子研究所 === 102 === Uncovering information from the large volume of data in a timely manner has been an imperative task in the next wave of computing technologies. Estimating the density distribution of the high dimensional data samples is an effective method to comprehend the characteristics of the data space. Bayesian Sequential Partitioning (BSP) is a statistically effective density estimation algorithm for high dimensional data. However, BSP is computationally expensive due to complex statistical model, and data intensive when counting the large volume samples. This thesis proposes a high performance design of BSP by leveraging the powerful computation capability of a heterogeneous many-core system. A series of techniques are implemented on both algorithm flow and data management policies. With the proposed approaches, the performance bottleneck is alleviated and the runtime is significantly improved. The overall speedup of the BSP analysis on a heterogeneous many-core system can reach up to 155.3x, when compared with the reference design on a high-end CPU.
|
author2 |
Lai, Bo-Cheng |
author_facet |
Lai, Bo-Cheng Li, Kun-Chun 李坤駿 |
author |
Li, Kun-Chun 李坤駿 |
spellingShingle |
Li, Kun-Chun 李坤駿 Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems |
author_sort |
Li, Kun-Chun |
title |
Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems |
title_short |
Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems |
title_full |
Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems |
title_fullStr |
Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems |
title_full_unstemmed |
Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems |
title_sort |
bayesian sequential partitioning for big data analysis on heterogeneous manycore systems |
publishDate |
2014 |
url |
http://ndltd.ncl.edu.tw/handle/wvexk9 |
work_keys_str_mv |
AT likunchun bayesiansequentialpartitioningforbigdataanalysisonheterogeneousmanycoresystems AT lǐkūnjùn bayesiansequentialpartitioningforbigdataanalysisonheterogeneousmanycoresystems AT likunchun yìzhìduōhéxīnpíngtáishíxiànjùliàngzīliàodebèishìxùxiàngqiègēzhīfēnxīyánjiū AT lǐkūnjùn yìzhìduōhéxīnpíngtáishíxiànjùliàngzīliàodebèishìxùxiàngqiègēzhīfēnxīyánjiū |
_version_ |
1719119896554504192 |