Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining

Combining several classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. This paper introduces human recalling and forgetting mechanisms into a data stream mining system and proposes a Memorizing Based Data Stream Mining (MDSM) model....

Full description

Bibliographic Details
Main Authors: Yanhuang Jiang, Qiangli Zhao, Yutong Lu
Format: Article
Language:English
Published: Hindawi Limited 2015-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2015/874032
id doaj-8f65e87f10084599bfdac01af8d98af1
record_format Article
spelling doaj-8f65e87f10084599bfdac01af8d98af12020-11-24T22:06:33ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472015-01-01201510.1155/2015/874032874032Adaptive Ensemble with Human Memorizing Characteristics for Data Stream MiningYanhuang Jiang0Qiangli Zhao1Yutong Lu2State Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha 410073, ChinaState Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha 410073, ChinaState Key Laboratory of High Performance Computing, National University of Defense Technology, Changsha 410073, ChinaCombining several classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. This paper introduces human recalling and forgetting mechanisms into a data stream mining system and proposes a Memorizing Based Data Stream Mining (MDSM) model. In this model, each component classifier is regarded as a piece of knowledge that a human obtains through learning some materials and has a memory retention value reflecting its usefulness in the history. The classifiers with high memory retention values are reserved in a “knowledge repository.” When a new data chunk comes, most useful classifiers will be selected (recalled) from the repository and compose the current target ensemble. Based on MDSM, we put forward a new algorithm, MAE (Memorizing Based Adaptive Ensemble), which uses Ebbinghaus forgetting curve as the forgetting mechanism and adopts ensemble pruning as the recalling mechanism. Compared with four popular data stream mining approaches on the datasets with different concept drifts, the experimental results show that MAE achieves high and stable predicting accuracy, especially for the applications with recurring or complex concept drifts. The results also prove the effectiveness of MDSM model.http://dx.doi.org/10.1155/2015/874032
collection DOAJ
language English
format Article
sources DOAJ
author Yanhuang Jiang
Qiangli Zhao
Yutong Lu
spellingShingle Yanhuang Jiang
Qiangli Zhao
Yutong Lu
Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining
Mathematical Problems in Engineering
author_facet Yanhuang Jiang
Qiangli Zhao
Yutong Lu
author_sort Yanhuang Jiang
title Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining
title_short Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining
title_full Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining
title_fullStr Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining
title_full_unstemmed Adaptive Ensemble with Human Memorizing Characteristics for Data Stream Mining
title_sort adaptive ensemble with human memorizing characteristics for data stream mining
publisher Hindawi Limited
series Mathematical Problems in Engineering
issn 1024-123X
1563-5147
publishDate 2015-01-01
description Combining several classifiers on sequential chunks of training instances is a popular strategy for data stream mining with concept drifts. This paper introduces human recalling and forgetting mechanisms into a data stream mining system and proposes a Memorizing Based Data Stream Mining (MDSM) model. In this model, each component classifier is regarded as a piece of knowledge that a human obtains through learning some materials and has a memory retention value reflecting its usefulness in the history. The classifiers with high memory retention values are reserved in a “knowledge repository.” When a new data chunk comes, most useful classifiers will be selected (recalled) from the repository and compose the current target ensemble. Based on MDSM, we put forward a new algorithm, MAE (Memorizing Based Adaptive Ensemble), which uses Ebbinghaus forgetting curve as the forgetting mechanism and adopts ensemble pruning as the recalling mechanism. Compared with four popular data stream mining approaches on the datasets with different concept drifts, the experimental results show that MAE achieves high and stable predicting accuracy, especially for the applications with recurring or complex concept drifts. The results also prove the effectiveness of MDSM model.
url http://dx.doi.org/10.1155/2015/874032
work_keys_str_mv AT yanhuangjiang adaptiveensemblewithhumanmemorizingcharacteristicsfordatastreammining
AT qianglizhao adaptiveensemblewithhumanmemorizingcharacteristicsfordatastreammining
AT yutonglu adaptiveensemblewithhumanmemorizingcharacteristicsfordatastreammining
_version_ 1725823145790668800