FORF-S: A Novel Classification Technique for Class Imbalance Problem
In recent years, the class imbalance problem that aims to correctly classify imbalanced data sets and improve the classification performance of minority instances has received attention. Such problem can be roughly described as one of the class(es) termed as minority class(es) contains much smaller...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9272759/ |
id |
doaj-a0c90cd6a85e4808a328b26d7dbe8593 |
---|---|
record_format |
Article |
spelling |
doaj-a0c90cd6a85e4808a328b26d7dbe85932021-03-30T03:28:36ZengIEEEIEEE Access2169-35362020-01-01821872021872810.1109/ACCESS.2020.30409789272759FORF-S: A Novel Classification Technique for Class Imbalance ProblemYulin Jian0Mao Ye1https://orcid.org/0000-0003-4760-8702Yan Min2Liang Tian3Guangjun Wang4School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaIn recent years, the class imbalance problem that aims to correctly classify imbalanced data sets and improve the classification performance of minority instances has received attention. Such problem can be roughly described as one of the class(es) termed as minority class(es) contains much smaller instances than the others, also referred as majority class(es). To address this problem, a novel classification method called focused online random forest based on synthetic minority oversampling technique (FORF-SMOTE) is proposed in this paper and simply expressed as FORF-S, which constructs two online random forests respectively trained by original training dataset and new generated dataset, then further jointly constitute the model. Instead of oversampling the minority instances in data level, the algorithm in this paper is motivated by making the sampling strategies integrated into algorithm level to create classifiers, which can better identify the minority class. Moreover, the method is also compared with other state-of-the-art methods and the results have demonstrated that the proposed algorithm takes advantages of the aforementioned methods.https://ieeexplore.ieee.org/document/9272759/Imbalanced dataonline random forestsynthetic minority oversampling techniquenearest neighbor |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yulin Jian Mao Ye Yan Min Liang Tian Guangjun Wang |
spellingShingle |
Yulin Jian Mao Ye Yan Min Liang Tian Guangjun Wang FORF-S: A Novel Classification Technique for Class Imbalance Problem IEEE Access Imbalanced data online random forest synthetic minority oversampling technique nearest neighbor |
author_facet |
Yulin Jian Mao Ye Yan Min Liang Tian Guangjun Wang |
author_sort |
Yulin Jian |
title |
FORF-S: A Novel Classification Technique for Class Imbalance Problem |
title_short |
FORF-S: A Novel Classification Technique for Class Imbalance Problem |
title_full |
FORF-S: A Novel Classification Technique for Class Imbalance Problem |
title_fullStr |
FORF-S: A Novel Classification Technique for Class Imbalance Problem |
title_full_unstemmed |
FORF-S: A Novel Classification Technique for Class Imbalance Problem |
title_sort |
forf-s: a novel classification technique for class imbalance problem |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
In recent years, the class imbalance problem that aims to correctly classify imbalanced data sets and improve the classification performance of minority instances has received attention. Such problem can be roughly described as one of the class(es) termed as minority class(es) contains much smaller instances than the others, also referred as majority class(es). To address this problem, a novel classification method called focused online random forest based on synthetic minority oversampling technique (FORF-SMOTE) is proposed in this paper and simply expressed as FORF-S, which constructs two online random forests respectively trained by original training dataset and new generated dataset, then further jointly constitute the model. Instead of oversampling the minority instances in data level, the algorithm in this paper is motivated by making the sampling strategies integrated into algorithm level to create classifiers, which can better identify the minority class. Moreover, the method is also compared with other state-of-the-art methods and the results have demonstrated that the proposed algorithm takes advantages of the aforementioned methods. |
topic |
Imbalanced data online random forest synthetic minority oversampling technique nearest neighbor |
url |
https://ieeexplore.ieee.org/document/9272759/ |
work_keys_str_mv |
AT yulinjian forfsanovelclassificationtechniqueforclassimbalanceproblem AT maoye forfsanovelclassificationtechniqueforclassimbalanceproblem AT yanmin forfsanovelclassificationtechniqueforclassimbalanceproblem AT liangtian forfsanovelclassificationtechniqueforclassimbalanceproblem AT guangjunwang forfsanovelclassificationtechniqueforclassimbalanceproblem |
_version_ |
1724183441437622272 |