An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification

The selection of a meta-learner determines the success of a stacked ensemble as the meta-learner is responsible for the final predictions of the stacked ensemble. Unfortunately, in imbalanced classification, selecting an appropriate and well-performing meta-learner of stacked ensemble is not straigh...

Full description

Bibliographic Details
Main Authors: Seng Zian, Sameem Abdul Kareem, Kasturi Dewi Varathan
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9452051/
id doaj-0af07fa3e4a74fed9d8cbc111c5d7ce8
record_format Article
spelling doaj-0af07fa3e4a74fed9d8cbc111c5d7ce82021-06-21T23:00:56ZengIEEEIEEE Access2169-35362021-01-019874348745210.1109/ACCESS.2021.30884149452051An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced ClassificationSeng Zian0https://orcid.org/0000-0001-8048-5842Sameem Abdul Kareem1Kasturi Dewi Varathan2https://orcid.org/0000-0003-3421-4501Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, MalaysiaDepartment of Artificial Intelligence, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, MalaysiaDepartment of Information Systems, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, MalaysiaThe selection of a meta-learner determines the success of a stacked ensemble as the meta-learner is responsible for the final predictions of the stacked ensemble. Unfortunately, in imbalanced classification, selecting an appropriate and well-performing meta-learner of stacked ensemble is not straightforward as different meta-learners are advocated by different researchers. To investigate and identify a well-performing type of meta-learner in stacked ensemble for imbalanced classification, an experiment consisting of 19 meta-learners was conducted, detailed in this paper. Among the 19 meta-learners of stacked ensembles, a new weighted combination-based meta-learner that maximizes the H-measure during the training of stacked ensemble was first introduced and implemented in the empirical evaluation of this paper. The classification performances of stacked ensembles with 19 different meta-learners were recorded using both the area under the receiver operating characteristic curve (AUC) and H-measure (a metric that overcomes the deficiencies of the AUC). The weighted combination-based meta-learners of stacked ensembles have better classification performances on imbalanced datasets when compared to bagging-based, boosting-based, Decision Trees, Support Vector Machines, Naive Bayes, and Feedforward Neural Network meta-learners. Thus, the adoption of weighted combination-based meta-learners in stacked ensembles is recommended for their better performance on imbalanced datasets. Also, based on the empirical results, we identified better-performing meta-learners (such as the AUC maximizing meta-learner and the H-measure maximizing meta-learner) than the widely adopted meta-learner – Logistic Regression – in imbalanced classification.https://ieeexplore.ieee.org/document/9452051/Class imbalanceH-measureimbalanced classificationmeta-learnerstacked ensemblestacking
collection DOAJ
language English
format Article
sources DOAJ
author Seng Zian
Sameem Abdul Kareem
Kasturi Dewi Varathan
spellingShingle Seng Zian
Sameem Abdul Kareem
Kasturi Dewi Varathan
An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
IEEE Access
Class imbalance
H-measure
imbalanced classification
meta-learner
stacked ensemble
stacking
author_facet Seng Zian
Sameem Abdul Kareem
Kasturi Dewi Varathan
author_sort Seng Zian
title An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_short An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_full An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_fullStr An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_full_unstemmed An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_sort empirical evaluation of stacked ensembles with different meta-learners in imbalanced classification
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description The selection of a meta-learner determines the success of a stacked ensemble as the meta-learner is responsible for the final predictions of the stacked ensemble. Unfortunately, in imbalanced classification, selecting an appropriate and well-performing meta-learner of stacked ensemble is not straightforward as different meta-learners are advocated by different researchers. To investigate and identify a well-performing type of meta-learner in stacked ensemble for imbalanced classification, an experiment consisting of 19 meta-learners was conducted, detailed in this paper. Among the 19 meta-learners of stacked ensembles, a new weighted combination-based meta-learner that maximizes the H-measure during the training of stacked ensemble was first introduced and implemented in the empirical evaluation of this paper. The classification performances of stacked ensembles with 19 different meta-learners were recorded using both the area under the receiver operating characteristic curve (AUC) and H-measure (a metric that overcomes the deficiencies of the AUC). The weighted combination-based meta-learners of stacked ensembles have better classification performances on imbalanced datasets when compared to bagging-based, boosting-based, Decision Trees, Support Vector Machines, Naive Bayes, and Feedforward Neural Network meta-learners. Thus, the adoption of weighted combination-based meta-learners in stacked ensembles is recommended for their better performance on imbalanced datasets. Also, based on the empirical results, we identified better-performing meta-learners (such as the AUC maximizing meta-learner and the H-measure maximizing meta-learner) than the widely adopted meta-learner – Logistic Regression – in imbalanced classification.
topic Class imbalance
H-measure
imbalanced classification
meta-learner
stacked ensemble
stacking
url https://ieeexplore.ieee.org/document/9452051/
work_keys_str_mv AT sengzian anempiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification
AT sameemabdulkareem anempiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification
AT kasturidewivarathan anempiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification
AT sengzian empiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification
AT sameemabdulkareem empiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification
AT kasturidewivarathan empiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification
_version_ 1721363986017943552