Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems

博士 === 國立交通大學 === 電子工程學系 電子研究所 === 102 === Uncovering information from the large volume of data in a timely manner has been an imperative task in the next wave of computing technologies. Estimating the density distribution of the high dimensional data samples is an effective method to comprehend the...

Full description

Bibliographic Details
Main Authors: Li, Kun-Chun, 李坤駿
Other Authors: Lai, Bo-Cheng
Format: Others
Language:en_US
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/wvexk9
id ndltd-TW-102NCTU5428144
record_format oai_dc
spelling ndltd-TW-102NCTU54281442019-05-15T21:50:57Z http://ndltd.ncl.edu.tw/handle/wvexk9 Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems 異質多核心平台實現巨量資料的貝氏序向切割之分析研究 Li, Kun-Chun 李坤駿 博士 國立交通大學 電子工程學系 電子研究所 102 Uncovering information from the large volume of data in a timely manner has been an imperative task in the next wave of computing technologies. Estimating the density distribution of the high dimensional data samples is an effective method to comprehend the characteristics of the data space. Bayesian Sequential Partitioning (BSP) is a statistically effective density estimation algorithm for high dimensional data. However, BSP is computationally expensive due to complex statistical model, and data intensive when counting the large volume samples. This thesis proposes a high performance design of BSP by leveraging the powerful computation capability of a heterogeneous many-core system. A series of techniques are implemented on both algorithm flow and data management policies. With the proposed approaches, the performance bottleneck is alleviated and the runtime is significantly improved. The overall speedup of the BSP analysis on a heterogeneous many-core system can reach up to 155.3x, when compared with the reference design on a high-end CPU. Lai, Bo-Cheng 賴伯承 2014 學位論文 ; thesis 45 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 國立交通大學 === 電子工程學系 電子研究所 === 102 === Uncovering information from the large volume of data in a timely manner has been an imperative task in the next wave of computing technologies. Estimating the density distribution of the high dimensional data samples is an effective method to comprehend the characteristics of the data space. Bayesian Sequential Partitioning (BSP) is a statistically effective density estimation algorithm for high dimensional data. However, BSP is computationally expensive due to complex statistical model, and data intensive when counting the large volume samples. This thesis proposes a high performance design of BSP by leveraging the powerful computation capability of a heterogeneous many-core system. A series of techniques are implemented on both algorithm flow and data management policies. With the proposed approaches, the performance bottleneck is alleviated and the runtime is significantly improved. The overall speedup of the BSP analysis on a heterogeneous many-core system can reach up to 155.3x, when compared with the reference design on a high-end CPU.
author2 Lai, Bo-Cheng
author_facet Lai, Bo-Cheng
Li, Kun-Chun
李坤駿
author Li, Kun-Chun
李坤駿
spellingShingle Li, Kun-Chun
李坤駿
Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems
author_sort Li, Kun-Chun
title Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems
title_short Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems
title_full Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems
title_fullStr Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems
title_full_unstemmed Bayesian Sequential Partitioning for Big Data Analysis on Heterogeneous Manycore Systems
title_sort bayesian sequential partitioning for big data analysis on heterogeneous manycore systems
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/wvexk9
work_keys_str_mv AT likunchun bayesiansequentialpartitioningforbigdataanalysisonheterogeneousmanycoresystems
AT lǐkūnjùn bayesiansequentialpartitioningforbigdataanalysisonheterogeneousmanycoresystems
AT likunchun yìzhìduōhéxīnpíngtáishíxiànjùliàngzīliàodebèishìxùxiàngqiègēzhīfēnxīyánjiū
AT lǐkūnjùn yìzhìduōhéxīnpíngtáishíxiànjùliàngzīliàodebèishìxùxiàngqiègēzhīfēnxīyánjiū
_version_ 1719119896554504192