An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
The selection of a meta-learner determines the success of a stacked ensemble as the meta-learner is responsible for the final predictions of the stacked ensemble. Unfortunately, in imbalanced classification, selecting an appropriate and well-performing meta-learner of stacked ensemble is not straigh...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9452051/ |
id |
doaj-0af07fa3e4a74fed9d8cbc111c5d7ce8 |
---|---|
record_format |
Article |
spelling |
doaj-0af07fa3e4a74fed9d8cbc111c5d7ce82021-06-21T23:00:56ZengIEEEIEEE Access2169-35362021-01-019874348745210.1109/ACCESS.2021.30884149452051An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced ClassificationSeng Zian0https://orcid.org/0000-0001-8048-5842Sameem Abdul Kareem1Kasturi Dewi Varathan2https://orcid.org/0000-0003-3421-4501Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, MalaysiaDepartment of Artificial Intelligence, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, MalaysiaDepartment of Information Systems, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, MalaysiaThe selection of a meta-learner determines the success of a stacked ensemble as the meta-learner is responsible for the final predictions of the stacked ensemble. Unfortunately, in imbalanced classification, selecting an appropriate and well-performing meta-learner of stacked ensemble is not straightforward as different meta-learners are advocated by different researchers. To investigate and identify a well-performing type of meta-learner in stacked ensemble for imbalanced classification, an experiment consisting of 19 meta-learners was conducted, detailed in this paper. Among the 19 meta-learners of stacked ensembles, a new weighted combination-based meta-learner that maximizes the H-measure during the training of stacked ensemble was first introduced and implemented in the empirical evaluation of this paper. The classification performances of stacked ensembles with 19 different meta-learners were recorded using both the area under the receiver operating characteristic curve (AUC) and H-measure (a metric that overcomes the deficiencies of the AUC). The weighted combination-based meta-learners of stacked ensembles have better classification performances on imbalanced datasets when compared to bagging-based, boosting-based, Decision Trees, Support Vector Machines, Naive Bayes, and Feedforward Neural Network meta-learners. Thus, the adoption of weighted combination-based meta-learners in stacked ensembles is recommended for their better performance on imbalanced datasets. Also, based on the empirical results, we identified better-performing meta-learners (such as the AUC maximizing meta-learner and the H-measure maximizing meta-learner) than the widely adopted meta-learner – Logistic Regression – in imbalanced classification.https://ieeexplore.ieee.org/document/9452051/Class imbalanceH-measureimbalanced classificationmeta-learnerstacked ensemblestacking |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Seng Zian Sameem Abdul Kareem Kasturi Dewi Varathan |
spellingShingle |
Seng Zian Sameem Abdul Kareem Kasturi Dewi Varathan An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification IEEE Access Class imbalance H-measure imbalanced classification meta-learner stacked ensemble stacking |
author_facet |
Seng Zian Sameem Abdul Kareem Kasturi Dewi Varathan |
author_sort |
Seng Zian |
title |
An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification |
title_short |
An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification |
title_full |
An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification |
title_fullStr |
An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification |
title_full_unstemmed |
An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification |
title_sort |
empirical evaluation of stacked ensembles with different meta-learners in imbalanced classification |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
The selection of a meta-learner determines the success of a stacked ensemble as the meta-learner is responsible for the final predictions of the stacked ensemble. Unfortunately, in imbalanced classification, selecting an appropriate and well-performing meta-learner of stacked ensemble is not straightforward as different meta-learners are advocated by different researchers. To investigate and identify a well-performing type of meta-learner in stacked ensemble for imbalanced classification, an experiment consisting of 19 meta-learners was conducted, detailed in this paper. Among the 19 meta-learners of stacked ensembles, a new weighted combination-based meta-learner that maximizes the H-measure during the training of stacked ensemble was first introduced and implemented in the empirical evaluation of this paper. The classification performances of stacked ensembles with 19 different meta-learners were recorded using both the area under the receiver operating characteristic curve (AUC) and H-measure (a metric that overcomes the deficiencies of the AUC). The weighted combination-based meta-learners of stacked ensembles have better classification performances on imbalanced datasets when compared to bagging-based, boosting-based, Decision Trees, Support Vector Machines, Naive Bayes, and Feedforward Neural Network meta-learners. Thus, the adoption of weighted combination-based meta-learners in stacked ensembles is recommended for their better performance on imbalanced datasets. Also, based on the empirical results, we identified better-performing meta-learners (such as the AUC maximizing meta-learner and the H-measure maximizing meta-learner) than the widely adopted meta-learner – Logistic Regression – in imbalanced classification. |
topic |
Class imbalance H-measure imbalanced classification meta-learner stacked ensemble stacking |
url |
https://ieeexplore.ieee.org/document/9452051/ |
work_keys_str_mv |
AT sengzian anempiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT sameemabdulkareem anempiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT kasturidewivarathan anempiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT sengzian empiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT sameemabdulkareem empiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT kasturidewivarathan empiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification |
_version_ |
1721363986017943552 |