Fast Reaction to Sudden Concept Drift in the Absence of Class Labels
A data stream can be considered as a sequence of examples that arrive continuously and are potentially unbounded, such as web page visits, sensor readings and call records. One of the serious and challenging problems that appears in a data stream is concept drift. This problem occurs when the relati...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-01-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/10/2/606 |
id |
doaj-c291229acb6e44b4bc78923b301c20f2 |
---|---|
record_format |
Article |
spelling |
doaj-c291229acb6e44b4bc78923b301c20f22020-11-25T03:30:24ZengMDPI AGApplied Sciences2076-34172020-01-0110260610.3390/app10020606app10020606Fast Reaction to Sudden Concept Drift in the Absence of Class LabelsOsama A. Mahdi0Eric Pardede1Nawfal Ali2Jinli Cao3Computer Science and Information Technology, La Trobe University, Bundoora VIC 3086, AustraliaComputer Science and Information Technology, La Trobe University, Bundoora VIC 3086, AustraliaFaculty of Information Technology, Monash University, Clayton 3800, AustraliaComputer Science and Information Technology, La Trobe University, Bundoora VIC 3086, AustraliaA data stream can be considered as a sequence of examples that arrive continuously and are potentially unbounded, such as web page visits, sensor readings and call records. One of the serious and challenging problems that appears in a data stream is concept drift. This problem occurs when the relation between the input data and the target variable changes over time. Most existing works make an optimistic assumption that all incoming data are labelled and the class labels are available immediately. However, such an assumption is not always valid. Therefore, a lack of class labels aggravates the problem of concept drift detection. With this motivation, we propose a drift detector that reacts naturally to sudden drifts in the absence of class labels. In a novel way, the proposed detector reacts to concept drift in the absence of class labels, where the true label of an example is not necessary. Instead of monitoring the error estimates, the proposed detector monitors the diversity of a pair of classifiers, where the true label of an example is not necessary to determine whether components disagree. Using several datasets, an experimental evaluation and comparison is conducted against several existing detectors. The experiment results show that the proposed detector can detect drifts with less delay, runtime and memory usage.https://www.mdpi.com/2076-3417/10/2/606concept driftdata stream miningsemisupervised environment |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Osama A. Mahdi Eric Pardede Nawfal Ali Jinli Cao |
spellingShingle |
Osama A. Mahdi Eric Pardede Nawfal Ali Jinli Cao Fast Reaction to Sudden Concept Drift in the Absence of Class Labels Applied Sciences concept drift data stream mining semisupervised environment |
author_facet |
Osama A. Mahdi Eric Pardede Nawfal Ali Jinli Cao |
author_sort |
Osama A. Mahdi |
title |
Fast Reaction to Sudden Concept Drift in the Absence of Class Labels |
title_short |
Fast Reaction to Sudden Concept Drift in the Absence of Class Labels |
title_full |
Fast Reaction to Sudden Concept Drift in the Absence of Class Labels |
title_fullStr |
Fast Reaction to Sudden Concept Drift in the Absence of Class Labels |
title_full_unstemmed |
Fast Reaction to Sudden Concept Drift in the Absence of Class Labels |
title_sort |
fast reaction to sudden concept drift in the absence of class labels |
publisher |
MDPI AG |
series |
Applied Sciences |
issn |
2076-3417 |
publishDate |
2020-01-01 |
description |
A data stream can be considered as a sequence of examples that arrive continuously and are potentially unbounded, such as web page visits, sensor readings and call records. One of the serious and challenging problems that appears in a data stream is concept drift. This problem occurs when the relation between the input data and the target variable changes over time. Most existing works make an optimistic assumption that all incoming data are labelled and the class labels are available immediately. However, such an assumption is not always valid. Therefore, a lack of class labels aggravates the problem of concept drift detection. With this motivation, we propose a drift detector that reacts naturally to sudden drifts in the absence of class labels. In a novel way, the proposed detector reacts to concept drift in the absence of class labels, where the true label of an example is not necessary. Instead of monitoring the error estimates, the proposed detector monitors the diversity of a pair of classifiers, where the true label of an example is not necessary to determine whether components disagree. Using several datasets, an experimental evaluation and comparison is conducted against several existing detectors. The experiment results show that the proposed detector can detect drifts with less delay, runtime and memory usage. |
topic |
concept drift data stream mining semisupervised environment |
url |
https://www.mdpi.com/2076-3417/10/2/606 |
work_keys_str_mv |
AT osamaamahdi fastreactiontosuddenconceptdriftintheabsenceofclasslabels AT ericpardede fastreactiontosuddenconceptdriftintheabsenceofclasslabels AT nawfalali fastreactiontosuddenconceptdriftintheabsenceofclasslabels AT jinlicao fastreactiontosuddenconceptdriftintheabsenceofclasslabels |
_version_ |
1724575800155439104 |