An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification

The selection of a meta-learner determines the success of a stacked ensemble as the meta-learner is responsible for the final predictions of the stacked ensemble. Unfortunately, in imbalanced classification, selecting an appropriate and well-performing meta-learner of stacked ensemble is not straigh...

Full description

Bibliographic Details
Main Authors:	Seng Zian, Sameem Abdul Kareem, Kasturi Dewi Varathan
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Class imbalance H-measure imbalanced classification meta-learner stacked ensemble stacking
Online Access:	https://ieeexplore.ieee.org/document/9452051/

id	doaj-0af07fa3e4a74fed9d8cbc111c5d7ce8
record_format	Article
spelling	doaj-0af07fa3e4a74fed9d8cbc111c5d7ce82021-06-21T23:00:56ZengIEEEIEEE Access2169-35362021-01-019874348745210.1109/ACCESS.2021.30884149452051An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced ClassificationSeng Zian0https://orcid.org/0000-0001-8048-5842Sameem Abdul Kareem1Kasturi Dewi Varathan2https://orcid.org/0000-0003-3421-4501Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, MalaysiaDepartment of Artificial Intelligence, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, MalaysiaDepartment of Information Systems, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, MalaysiaThe selection of a meta-learner determines the success of a stacked ensemble as the meta-learner is responsible for the final predictions of the stacked ensemble. Unfortunately, in imbalanced classification, selecting an appropriate and well-performing meta-learner of stacked ensemble is not straightforward as different meta-learners are advocated by different researchers. To investigate and identify a well-performing type of meta-learner in stacked ensemble for imbalanced classification, an experiment consisting of 19 meta-learners was conducted, detailed in this paper. Among the 19 meta-learners of stacked ensembles, a new weighted combination-based meta-learner that maximizes the H-measure during the training of stacked ensemble was first introduced and implemented in the empirical evaluation of this paper. The classification performances of stacked ensembles with 19 different meta-learners were recorded using both the area under the receiver operating characteristic curve (AUC) and H-measure (a metric that overcomes the deficiencies of the AUC). The weighted combination-based meta-learners of stacked ensembles have better classification performances on imbalanced datasets when compared to bagging-based, boosting-based, Decision Trees, Support Vector Machines, Naive Bayes, and Feedforward Neural Network meta-learners. Thus, the adoption of weighted combination-based meta-learners in stacked ensembles is recommended for their better performance on imbalanced datasets. Also, based on the empirical results, we identified better-performing meta-learners (such as the AUC maximizing meta-learner and the H-measure maximizing meta-learner) than the widely adopted meta-learner – Logistic Regression – in imbalanced classification.https://ieeexplore.ieee.org/document/9452051/Class imbalanceH-measureimbalanced classificationmeta-learnerstacked ensemblestacking
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Seng Zian Sameem Abdul Kareem Kasturi Dewi Varathan
spellingShingle	Seng Zian Sameem Abdul Kareem Kasturi Dewi Varathan An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification IEEE Access Class imbalance H-measure imbalanced classification meta-learner stacked ensemble stacking
author_facet	Seng Zian Sameem Abdul Kareem Kasturi Dewi Varathan
author_sort	Seng Zian
title	An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_short	An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_full	An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_fullStr	An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_full_unstemmed	An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification
title_sort	empirical evaluation of stacked ensembles with different meta-learners in imbalanced classification
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	The selection of a meta-learner determines the success of a stacked ensemble as the meta-learner is responsible for the final predictions of the stacked ensemble. Unfortunately, in imbalanced classification, selecting an appropriate and well-performing meta-learner of stacked ensemble is not straightforward as different meta-learners are advocated by different researchers. To investigate and identify a well-performing type of meta-learner in stacked ensemble for imbalanced classification, an experiment consisting of 19 meta-learners was conducted, detailed in this paper. Among the 19 meta-learners of stacked ensembles, a new weighted combination-based meta-learner that maximizes the H-measure during the training of stacked ensemble was first introduced and implemented in the empirical evaluation of this paper. The classification performances of stacked ensembles with 19 different meta-learners were recorded using both the area under the receiver operating characteristic curve (AUC) and H-measure (a metric that overcomes the deficiencies of the AUC). The weighted combination-based meta-learners of stacked ensembles have better classification performances on imbalanced datasets when compared to bagging-based, boosting-based, Decision Trees, Support Vector Machines, Naive Bayes, and Feedforward Neural Network meta-learners. Thus, the adoption of weighted combination-based meta-learners in stacked ensembles is recommended for their better performance on imbalanced datasets. Also, based on the empirical results, we identified better-performing meta-learners (such as the AUC maximizing meta-learner and the H-measure maximizing meta-learner) than the widely adopted meta-learner – Logistic Regression – in imbalanced classification.
topic	Class imbalance H-measure imbalanced classification meta-learner stacked ensemble stacking
url	https://ieeexplore.ieee.org/document/9452051/
work_keys_str_mv	AT sengzian anempiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT sameemabdulkareem anempiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT kasturidewivarathan anempiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT sengzian empiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT sameemabdulkareem empiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification AT kasturidewivarathan empiricalevaluationofstackedensembleswithdifferentmetalearnersinimbalancedclassification
_version_	1721363986017943552

An Empirical Evaluation of Stacked Ensembles With Different Meta-Learners in Imbalanced Classification

Similar Items