Applying Curve Fitting Techniques to Construct the Synopsis of Data Streams
碩士 === 國立清華大學 === 資訊系統與應用研究所 === 94 === A data stream is a real-time, continuous, and ordered sequence of data items. It is a widely used data format to deal with large amount of dynamic data. Dynamic content and unbounded storage requirement are the two main characteristics of data streams. We n...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2006
|
Online Access: | http://ndltd.ncl.edu.tw/handle/60091574710726087517 |
id |
ndltd-TW-094NTHU5394008 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-094NTHU53940082016-06-01T04:14:41Z http://ndltd.ncl.edu.tw/handle/60091574710726087517 Applying Curve Fitting Techniques to Construct the Synopsis of Data Streams 應用曲線擬合建立資料串流之摘要結構 Yi-Kai Peng 彭義凱 碩士 國立清華大學 資訊系統與應用研究所 94 A data stream is a real-time, continuous, and ordered sequence of data items. It is a widely used data format to deal with large amount of dynamic data. Dynamic content and unbounded storage requirement are the two main characteristics of data streams. We need to deal with these two issues while processing data streams. For the dynamic content issue, the approximate answering is a widely used approach to process queries on data streams. For the unbounded storage size issue, some data structures have been proposed to summarize the data streams and keep the storage space required small. Synopsis is a data structure that summarizes the data streams. By using some algorithms, users can get approximate answers of data streams from the summarized information stored in synopsis. In this thesis, we use the curve fitting technique to construct the synopsis of data streams in the form of a curve that expressed by a polynomial function. The algorithms for constructing the synopsis data structure and querying the data stream are also proposed. We prove that the storage space required by the proposed method is O(logN). From the experimental results, we observe that our approach can achieve 95% accuracy on data contents for the queries. Yeh-Ching Chung 鍾葉青 2006 學位論文 ; thesis 25 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊系統與應用研究所 === 94 === A data stream is a real-time, continuous, and ordered sequence of data items. It is a widely used data format to deal with large amount of dynamic data. Dynamic content and unbounded storage requirement are the two main characteristics of data streams. We need to deal with these two issues while processing data streams. For the dynamic content issue, the approximate answering is a widely used approach to process queries on data streams. For the unbounded storage size issue, some data structures have been proposed to summarize the data streams and keep the storage space required small. Synopsis is a data structure that summarizes the data streams. By using some algorithms, users can get approximate answers of data streams from the summarized information stored in synopsis. In this thesis, we use the curve fitting technique to construct the synopsis of data streams in the form of a curve that expressed by a polynomial function. The algorithms for constructing the synopsis data structure and querying the data stream are also proposed. We prove that the storage space required by the proposed method is O(logN). From the experimental results, we observe that our approach can achieve 95% accuracy on data contents for the queries.
|
author2 |
Yeh-Ching Chung |
author_facet |
Yeh-Ching Chung Yi-Kai Peng 彭義凱 |
author |
Yi-Kai Peng 彭義凱 |
spellingShingle |
Yi-Kai Peng 彭義凱 Applying Curve Fitting Techniques to Construct the Synopsis of Data Streams |
author_sort |
Yi-Kai Peng |
title |
Applying Curve Fitting Techniques to Construct the Synopsis of Data Streams |
title_short |
Applying Curve Fitting Techniques to Construct the Synopsis of Data Streams |
title_full |
Applying Curve Fitting Techniques to Construct the Synopsis of Data Streams |
title_fullStr |
Applying Curve Fitting Techniques to Construct the Synopsis of Data Streams |
title_full_unstemmed |
Applying Curve Fitting Techniques to Construct the Synopsis of Data Streams |
title_sort |
applying curve fitting techniques to construct the synopsis of data streams |
publishDate |
2006 |
url |
http://ndltd.ncl.edu.tw/handle/60091574710726087517 |
work_keys_str_mv |
AT yikaipeng applyingcurvefittingtechniquestoconstructthesynopsisofdatastreams AT péngyìkǎi applyingcurvefittingtechniquestoconstructthesynopsisofdatastreams AT yikaipeng yīngyòngqūxiànnǐhéjiànlìzīliàochuànliúzhīzhāiyàojiégòu AT péngyìkǎi yīngyòngqūxiànnǐhéjiànlìzīliàochuànliúzhīzhāiyàojiégòu |
_version_ |
1718287360647495680 |