Deep Generative Models to Counter Class Imbalance: A Model-Metric Mapping With Proportion Calibration Methodology

The most pervasive segment of techniques in managing class imbalance in machine learning are re-sampling-based methods. The emergence of deep generative models for augmenting the size of the under-represented class, prompts one to review the question of the suitability of the model chosen for data a...

Full description

Bibliographic Details
Main Authors:	Behroz Mirza, Danish Haroon, Behraj Khan, Ali Padhani, Tahir Q. Syed
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Adversarial networks anomaly detection class imbalance deep generative models density estimation generative variational auto encoders
Online Access:	https://ieeexplore.ieee.org/document/9395632/

id	doaj-4a9d07c51a75479aac627072136355ef
record_format	Article
spelling	doaj-4a9d07c51a75479aac627072136355ef2021-04-14T23:00:21ZengIEEEIEEE Access2169-35362021-01-019558795589710.1109/ACCESS.2021.30713899395632Deep Generative Models to Counter Class Imbalance: A Model-Metric Mapping With Proportion Calibration MethodologyBehroz Mirza0Danish Haroon1Behraj Khan2https://orcid.org/0000-0003-0985-9543Ali Padhani3Tahir Q. Syed4https://orcid.org/0000-0003-0638-9689School of Computing, National University of Computer and Emerging Science, Karachi, PakistanSchool of Computing, National University of Computer and Emerging Science, Karachi, PakistanSchool of Computing, National University of Computer and Emerging Science, Karachi, PakistanSchool of Computing, National University of Computer and Emerging Science, Karachi, PakistanInstitute of Business Administration, Karachi, PakistanThe most pervasive segment of techniques in managing class imbalance in machine learning are re-sampling-based methods. The emergence of deep generative models for augmenting the size of the under-represented class, prompts one to review the question of the suitability of the model chosen for data augmentation with the metric selected for the-goodness-of classification. This work defines this suitability by using newly-sampled data points from each generative model first to the degree of parity, and studying classification performance on a large set of metrics. We extend the investigation to different proportions of augmented data points for identifying the sensitivity of the metric to the degree of imbalance, leading to the discovery of an optimum proportion against the metric. The models used are GAN, VAE and RBM and the metrics include Precision, Recall, F1-Score, AUC, G-Mean and Balanced Accuracy. We offer a comparison of these models with the established class of data synthesizing counterparts on the aforementioned metrics. Deep generative models outperform the state-of-the-art on 5 metrics on multiple datasets and also comprehensively surpass the baselines. This work thereby recommends the following model-metric mappings: VAE for high Precision and F1-Score, RBM for high Recall and GAN for high AUC, G-Mean and Balanced Accuracy under various recommended proportions of the minority class.https://ieeexplore.ieee.org/document/9395632/Adversarial networksanomaly detectionclass imbalancedeep generative modelsdensity estimationgenerative variational auto encoders
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Behroz Mirza Danish Haroon Behraj Khan Ali Padhani Tahir Q. Syed
spellingShingle	Behroz Mirza Danish Haroon Behraj Khan Ali Padhani Tahir Q. Syed Deep Generative Models to Counter Class Imbalance: A Model-Metric Mapping With Proportion Calibration Methodology IEEE Access Adversarial networks anomaly detection class imbalance deep generative models density estimation generative variational auto encoders
author_facet	Behroz Mirza Danish Haroon Behraj Khan Ali Padhani Tahir Q. Syed
author_sort	Behroz Mirza
title	Deep Generative Models to Counter Class Imbalance: A Model-Metric Mapping With Proportion Calibration Methodology
title_short	Deep Generative Models to Counter Class Imbalance: A Model-Metric Mapping With Proportion Calibration Methodology
title_full	Deep Generative Models to Counter Class Imbalance: A Model-Metric Mapping With Proportion Calibration Methodology
title_fullStr	Deep Generative Models to Counter Class Imbalance: A Model-Metric Mapping With Proportion Calibration Methodology
title_full_unstemmed	Deep Generative Models to Counter Class Imbalance: A Model-Metric Mapping With Proportion Calibration Methodology
title_sort	deep generative models to counter class imbalance: a model-metric mapping with proportion calibration methodology
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	The most pervasive segment of techniques in managing class imbalance in machine learning are re-sampling-based methods. The emergence of deep generative models for augmenting the size of the under-represented class, prompts one to review the question of the suitability of the model chosen for data augmentation with the metric selected for the-goodness-of classification. This work defines this suitability by using newly-sampled data points from each generative model first to the degree of parity, and studying classification performance on a large set of metrics. We extend the investigation to different proportions of augmented data points for identifying the sensitivity of the metric to the degree of imbalance, leading to the discovery of an optimum proportion against the metric. The models used are GAN, VAE and RBM and the metrics include Precision, Recall, F1-Score, AUC, G-Mean and Balanced Accuracy. We offer a comparison of these models with the established class of data synthesizing counterparts on the aforementioned metrics. Deep generative models outperform the state-of-the-art on 5 metrics on multiple datasets and also comprehensively surpass the baselines. This work thereby recommends the following model-metric mappings: VAE for high Precision and F1-Score, RBM for high Recall and GAN for high AUC, G-Mean and Balanced Accuracy under various recommended proportions of the minority class.
topic	Adversarial networks anomaly detection class imbalance deep generative models density estimation generative variational auto encoders
url	https://ieeexplore.ieee.org/document/9395632/
work_keys_str_mv	AT behrozmirza deepgenerativemodelstocounterclassimbalanceamodelmetricmappingwithproportioncalibrationmethodology AT danishharoon deepgenerativemodelstocounterclassimbalanceamodelmetricmappingwithproportioncalibrationmethodology AT behrajkhan deepgenerativemodelstocounterclassimbalanceamodelmetricmappingwithproportioncalibrationmethodology AT alipadhani deepgenerativemodelstocounterclassimbalanceamodelmetricmappingwithproportioncalibrationmethodology AT tahirqsyed deepgenerativemodelstocounterclassimbalanceamodelmetricmappingwithproportioncalibrationmethodology
_version_	1721526905801277440

Deep Generative Models to Counter Class Imbalance: A Model-Metric Mapping With Proportion Calibration Methodology

Similar Items