Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining
Combining several classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. This paper introduces human recalling and forgetting mechanisms into a data stream mining system and proposes a Memorizing Based Data Stream Mining (MDSM) model....
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2015-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2015/874032 |
id |
doaj-8f65e87f10084599bfdac01af8d98af1 |
---|---|
record_format |
Article |
spelling |
doaj-8f65e87f10084599bfdac01af8d98af12020-11-24T22:06:33ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472015-01-01201510.1155/2015/874032874032Adaptive Ensemble with Human Memorizing Characteristics for Data Stream MiningYanhuang Jiang0Qiangli Zhao1Yutong Lu2State Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha 410073, ChinaState Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha 410073, ChinaState Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha 410073, ChinaCombining several classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. This paper introduces human recalling and forgetting mechanisms into a data stream mining system and proposes a Memorizing Based Data Stream Mining (MDSM) model. In this model, each component classifier is regarded as a piece of knowledge that a human obtains through learning some materials and has a memory retention value reflecting its usefulness in the history. The classifiers with high memory retention values are reserved in a “knowledge repository.” When a new data chunk comes, most useful classifiers will be selected (recalled) from the repository and compose the current target ensemble. Based on MDSM, we put forward a new algorithm, MAE (Memorizing Based Adaptive Ensemble), which uses Ebbinghaus forgetting curve as the forgetting mechanism and adopts ensemble pruning as the recalling mechanism. Compared with four popular data stream mining approaches on the datasets with different concept drifts, the experimental results show that MAE achieves high and stable predicting accuracy, especially for the applications with recurring or complex concept drifts. The results also prove the effectiveness of MDSM model.http://dx.doi.org/10.1155/2015/874032 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yanhuang Jiang Qiangli Zhao Yutong Lu |
spellingShingle |
Yanhuang Jiang Qiangli Zhao Yutong Lu Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining Mathematical Problems in Engineering |
author_facet |
Yanhuang Jiang Qiangli Zhao Yutong Lu |
author_sort |
Yanhuang Jiang |
title |
Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining |
title_short |
Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining |
title_full |
Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining |
title_fullStr |
Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining |
title_full_unstemmed |
Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining |
title_sort |
adaptive ensemble with human memorizing characteristics for data stream mining |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1024-123X 1563-5147 |
publishDate |
2015-01-01 |
description |
Combining several classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. This paper introduces human recalling and forgetting mechanisms into a data stream mining system and proposes a Memorizing Based Data Stream Mining (MDSM) model. In this model, each component classifier is regarded as a piece of knowledge that a human obtains through learning some materials and has a memory retention value reflecting its usefulness in the history. The classifiers with high memory retention values are reserved in a “knowledge repository.” When a new data chunk comes, most useful classifiers will be selected (recalled) from the repository and compose the current target ensemble. Based on MDSM, we put forward a new algorithm, MAE (Memorizing Based Adaptive Ensemble), which uses Ebbinghaus forgetting curve as the forgetting mechanism and adopts ensemble pruning as the recalling mechanism. Compared with four popular data stream mining approaches on the datasets with different concept drifts, the experimental results show that MAE achieves high and stable predicting accuracy, especially for the applications with recurring or complex concept drifts. The results also prove the effectiveness of MDSM model. |
url |
http://dx.doi.org/10.1155/2015/874032 |
work_keys_str_mv |
AT yanhuangjiang adaptiveensemblewithhumanmemorizingcharacteristicsfordatastreammining AT qianglizhao adaptiveensemblewithhumanmemorizingcharacteristicsfordatastreammining AT yutonglu adaptiveensemblewithhumanmemorizingcharacteristicsfordatastreammining |
_version_ |
1725823145790668800 |