Exploring Probabilistic Top-k Dominating Query over Multiple Uncertain Data Streams

碩士 === 國立臺北科技大學 === 資訊工程系研究所 === 105 === How to effectively derive the valuable data in Big Data is one of the important research issues in Data Science. The data we collect may be uncertain, imprecise, or a statistic due to the device or transmission error. In addition, the data may change as time...

Full description

Bibliographic Details
Main Authors: Tien-Chun Wang, 王天駿
Other Authors: Chuan-Ming Liu
Format: Others
Language:zh-TW
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/w46b7n
id ndltd-TW-105TIT05392043
record_format oai_dc
spelling ndltd-TW-105TIT053920432019-05-15T23:53:43Z http://ndltd.ncl.edu.tw/handle/w46b7n Exploring Probabilistic Top-k Dominating Query over Multiple Uncertain Data Streams 多重不確定資料流上機率性前k個征服者搜尋之探討 Tien-Chun Wang 王天駿 碩士 國立臺北科技大學 資訊工程系研究所 105 How to effectively derive the valuable data in Big Data is one of the important research issues in Data Science. The data we collect may be uncertain, imprecise, or a statistic due to the device or transmission error. In addition, the data may change as time evolves and we refer to such a data set as an uncertain data stream that has, velocity, veracity, and volume properties. This paper explores the top-k dominating query process on uncertain data streams and employs the parallel computation to facilitate the query process. The challenges include how to quickly update the result and reduce the computation cost for processing uncertainty. By referring the related existing papers for certain data, we provide an effective top-k dominating query process on uncertain data streams in terms of time and space and the provided approach can be parallelized easily. After discussing the properties of the proposed approach, we validate our methods through extensive simulated experiments. The experimental results indicates that our algorithms can avoid the unnecessary computation effectively and reduce lots of communication throughput between servers, thus achieving the objective of updating the results quickly. Chuan-Ming Liu 劉傳銘 2017 學位論文 ; thesis 0 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北科技大學 === 資訊工程系研究所 === 105 === How to effectively derive the valuable data in Big Data is one of the important research issues in Data Science. The data we collect may be uncertain, imprecise, or a statistic due to the device or transmission error. In addition, the data may change as time evolves and we refer to such a data set as an uncertain data stream that has, velocity, veracity, and volume properties. This paper explores the top-k dominating query process on uncertain data streams and employs the parallel computation to facilitate the query process. The challenges include how to quickly update the result and reduce the computation cost for processing uncertainty. By referring the related existing papers for certain data, we provide an effective top-k dominating query process on uncertain data streams in terms of time and space and the provided approach can be parallelized easily. After discussing the properties of the proposed approach, we validate our methods through extensive simulated experiments. The experimental results indicates that our algorithms can avoid the unnecessary computation effectively and reduce lots of communication throughput between servers, thus achieving the objective of updating the results quickly.
author2 Chuan-Ming Liu
author_facet Chuan-Ming Liu
Tien-Chun Wang
王天駿
author Tien-Chun Wang
王天駿
spellingShingle Tien-Chun Wang
王天駿
Exploring Probabilistic Top-k Dominating Query over Multiple Uncertain Data Streams
author_sort Tien-Chun Wang
title Exploring Probabilistic Top-k Dominating Query over Multiple Uncertain Data Streams
title_short Exploring Probabilistic Top-k Dominating Query over Multiple Uncertain Data Streams
title_full Exploring Probabilistic Top-k Dominating Query over Multiple Uncertain Data Streams
title_fullStr Exploring Probabilistic Top-k Dominating Query over Multiple Uncertain Data Streams
title_full_unstemmed Exploring Probabilistic Top-k Dominating Query over Multiple Uncertain Data Streams
title_sort exploring probabilistic top-k dominating query over multiple uncertain data streams
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/w46b7n
work_keys_str_mv AT tienchunwang exploringprobabilistictopkdominatingqueryovermultipleuncertaindatastreams
AT wángtiānjùn exploringprobabilistictopkdominatingqueryovermultipleuncertaindatastreams
AT tienchunwang duōzhòngbùquèdìngzīliàoliúshàngjīlǜxìngqiánkgèzhēngfúzhěsōuxúnzhītàntǎo
AT wángtiānjùn duōzhòngbùquèdìngzīliàoliúshàngjīlǜxìngqiánkgèzhēngfúzhěsōuxúnzhītàntǎo
_version_ 1719156245172060160