Mining Streaming Data with Concept Drifts
碩士 === 國立嘉義大學 === 資訊工程學系研究所 === 100 === Data mining frequently uses machine learning methods to process data, and these methods need to learn from training data so that they can make predictions on new data. Traditional data mining research on streamed data usually assumes that the data distribution...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Online Access: | http://ndltd.ncl.edu.tw/handle/64960926653396805165 |
id |
ndltd-TW-100NCYU5392001 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100NCYU53920012015-10-14T04:07:04Z http://ndltd.ncl.edu.tw/handle/64960926653396805165 Mining Streaming Data with Concept Drifts 探勘觀念漂移的串流資料 蘇郁喬 碩士 國立嘉義大學 資訊工程學系研究所 100 Data mining frequently uses machine learning methods to process data, and these methods need to learn from training data so that they can make predictions on new data. Traditional data mining research on streamed data usually assumes that the data distribution is stable. In the real world, however, concept drifts may occur in the continuously incoming data over time. When the quantity of input data increases, storing the enormous amount of training data not only consumes memory space but also increases training time. Handling data that have concept drifts in the traditional way usually mixes all kinds of concept drifts data to select training data, but uses these training data to build model may not be suitable. This research develops an effective and efficient method for selecting useful information from data stream according to data blocks that have common type of concept drift. The experiments on data generated according to both STAGGER and moving hyperplanes show that the proposed method produces better results. 陳耀輝 學位論文 ; thesis 0 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立嘉義大學 === 資訊工程學系研究所 === 100 === Data mining frequently uses machine learning methods to process data, and these methods need to learn from training data so that they can make predictions on new data. Traditional data mining research on streamed data usually assumes that the data distribution is stable. In the real world, however, concept drifts may occur in the continuously incoming data over time. When the quantity of input data increases, storing the enormous amount of training data not only consumes memory space but also increases training time. Handling data that have concept drifts in the traditional way usually mixes all kinds of concept drifts data to select training data, but uses these training data to build model may not be suitable. This research develops an effective and efficient method for selecting useful information from data stream according to data blocks that have common type of concept drift. The experiments on data generated according to both STAGGER and moving hyperplanes show that the proposed method produces better results.
|
author2 |
陳耀輝 |
author_facet |
陳耀輝 蘇郁喬 |
author |
蘇郁喬 |
spellingShingle |
蘇郁喬 Mining Streaming Data with Concept Drifts |
author_sort |
蘇郁喬 |
title |
Mining Streaming Data with Concept Drifts |
title_short |
Mining Streaming Data with Concept Drifts |
title_full |
Mining Streaming Data with Concept Drifts |
title_fullStr |
Mining Streaming Data with Concept Drifts |
title_full_unstemmed |
Mining Streaming Data with Concept Drifts |
title_sort |
mining streaming data with concept drifts |
url |
http://ndltd.ncl.edu.tw/handle/64960926653396805165 |
work_keys_str_mv |
AT sūyùqiáo miningstreamingdatawithconceptdrifts AT sūyùqiáo tànkānguānniànpiàoyídechuànliúzīliào |
_version_ |
1718090111875284992 |