FORF-S: A Novel Classification Technique for Class Imbalance Problem

In recent years, the class imbalance problem that aims to correctly classify imbalanced data sets and improve the classification performance of minority instances has received attention. Such problem can be roughly described as one of the class(es) termed as minority class(es) contains much smaller...

Full description

Bibliographic Details
Main Authors: Yulin Jian, Mao Ye, Yan Min, Liang Tian, Guangjun Wang
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9272759/
id doaj-a0c90cd6a85e4808a328b26d7dbe8593
record_format Article
spelling doaj-a0c90cd6a85e4808a328b26d7dbe85932021-03-30T03:28:36ZengIEEEIEEE Access2169-35362020-01-01821872021872810.1109/ACCESS.2020.30409789272759FORF-S: A Novel Classification Technique for Class Imbalance ProblemYulin Jian0Mao Ye1https://orcid.org/0000-0003-4760-8702Yan Min2Liang Tian3Guangjun Wang4School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaIn recent years, the class imbalance problem that aims to correctly classify imbalanced data sets and improve the classification performance of minority instances has received attention. Such problem can be roughly described as one of the class(es) termed as minority class(es) contains much smaller instances than the others, also referred as majority class(es). To address this problem, a novel classification method called focused online random forest based on synthetic minority oversampling technique (FORF-SMOTE) is proposed in this paper and simply expressed as FORF-S, which constructs two online random forests respectively trained by original training dataset and new generated dataset, then further jointly constitute the model. Instead of oversampling the minority instances in data level, the algorithm in this paper is motivated by making the sampling strategies integrated into algorithm level to create classifiers, which can better identify the minority class. Moreover, the method is also compared with other state-of-the-art methods and the results have demonstrated that the proposed algorithm takes advantages of the aforementioned methods.https://ieeexplore.ieee.org/document/9272759/Imbalanced dataonline random forestsynthetic minority oversampling techniquenearest neighbor
collection DOAJ
language English
format Article
sources DOAJ
author Yulin Jian
Mao Ye
Yan Min
Liang Tian
Guangjun Wang
spellingShingle Yulin Jian
Mao Ye
Yan Min
Liang Tian
Guangjun Wang
FORF-S: A Novel Classification Technique for Class Imbalance Problem
IEEE Access
Imbalanced data
online random forest
synthetic minority oversampling technique
nearest neighbor
author_facet Yulin Jian
Mao Ye
Yan Min
Liang Tian
Guangjun Wang
author_sort Yulin Jian
title FORF-S: A Novel Classification Technique for Class Imbalance Problem
title_short FORF-S: A Novel Classification Technique for Class Imbalance Problem
title_full FORF-S: A Novel Classification Technique for Class Imbalance Problem
title_fullStr FORF-S: A Novel Classification Technique for Class Imbalance Problem
title_full_unstemmed FORF-S: A Novel Classification Technique for Class Imbalance Problem
title_sort forf-s: a novel classification technique for class imbalance problem
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description In recent years, the class imbalance problem that aims to correctly classify imbalanced data sets and improve the classification performance of minority instances has received attention. Such problem can be roughly described as one of the class(es) termed as minority class(es) contains much smaller instances than the others, also referred as majority class(es). To address this problem, a novel classification method called focused online random forest based on synthetic minority oversampling technique (FORF-SMOTE) is proposed in this paper and simply expressed as FORF-S, which constructs two online random forests respectively trained by original training dataset and new generated dataset, then further jointly constitute the model. Instead of oversampling the minority instances in data level, the algorithm in this paper is motivated by making the sampling strategies integrated into algorithm level to create classifiers, which can better identify the minority class. Moreover, the method is also compared with other state-of-the-art methods and the results have demonstrated that the proposed algorithm takes advantages of the aforementioned methods.
topic Imbalanced data
online random forest
synthetic minority oversampling technique
nearest neighbor
url https://ieeexplore.ieee.org/document/9272759/
work_keys_str_mv AT yulinjian forfsanovelclassificationtechniqueforclassimbalanceproblem
AT maoye forfsanovelclassificationtechniqueforclassimbalanceproblem
AT yanmin forfsanovelclassificationtechniqueforclassimbalanceproblem
AT liangtian forfsanovelclassificationtechniqueforclassimbalanceproblem
AT guangjunwang forfsanovelclassificationtechniqueforclassimbalanceproblem
_version_ 1724183441437622272