Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment
Machine learning algorithms have been widely used in the field of client credit assessment. However, few of the algorithms have focused on and solved the problems of concept drift and class imbalance. Due to changes in the macroeconomic environment and markets, the relationship between client charac...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019-07-01
|
Series: | Symmetry |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-8994/11/7/890 |
id |
doaj-61e299464a8944e995d3e8bc0827f5e6 |
---|---|
record_format |
Article |
spelling |
doaj-61e299464a8944e995d3e8bc0827f5e62020-11-25T01:52:32ZengMDPI AGSymmetry2073-89942019-07-0111789010.3390/sym11070890sym11070890Online Learning Method for Drift and Imbalance Problem in Client Credit AssessmentHang Zhang0Qingbao Liu1Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410073, ChinaScience and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410073, ChinaMachine learning algorithms have been widely used in the field of client credit assessment. However, few of the algorithms have focused on and solved the problems of concept drift and class imbalance. Due to changes in the macroeconomic environment and markets, the relationship between client characteristics and credit assessment results may change over time, causing concept drift in client credit assessments. Moreover, client credit assessment data are naturally asymmetric and class imbalanced because of the screening of clients. Aiming at solving the joint research issue of concept drift and class imbalance in client credit assessments, in this paper, a novel sample-based online learning ensemble (SOLE) for client credit assessment is proposed. A novel multiple time scale ensemble classifier and a novel sample-based online class imbalance learning procedure are proposed to handle the potential concept drift and class imbalance in the client credit assessment data streams. The experiments are carried out on two real-world client credit assessment cases, which present a comprehensive comparison between the proposed SOLE and other state-of-the-art online learning algorithms. In addition, the base classifier preference and the computing resource consumption of all the comparative algorithms are tested. In general, SOLE achieves a better performance than other methods using fewer computing resources. In addition, the results of the credit scoring model and the Kolmogorov−Smirnov (KS) test also prove that SOLE has good practicality in actual client credit assessment applications.https://www.mdpi.com/2073-8994/11/7/890credit assessmentonline learningensemble learningsampling method |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hang Zhang Qingbao Liu |
spellingShingle |
Hang Zhang Qingbao Liu Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment Symmetry credit assessment online learning ensemble learning sampling method |
author_facet |
Hang Zhang Qingbao Liu |
author_sort |
Hang Zhang |
title |
Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment |
title_short |
Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment |
title_full |
Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment |
title_fullStr |
Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment |
title_full_unstemmed |
Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment |
title_sort |
online learning method for drift and imbalance problem in client credit assessment |
publisher |
MDPI AG |
series |
Symmetry |
issn |
2073-8994 |
publishDate |
2019-07-01 |
description |
Machine learning algorithms have been widely used in the field of client credit assessment. However, few of the algorithms have focused on and solved the problems of concept drift and class imbalance. Due to changes in the macroeconomic environment and markets, the relationship between client characteristics and credit assessment results may change over time, causing concept drift in client credit assessments. Moreover, client credit assessment data are naturally asymmetric and class imbalanced because of the screening of clients. Aiming at solving the joint research issue of concept drift and class imbalance in client credit assessments, in this paper, a novel sample-based online learning ensemble (SOLE) for client credit assessment is proposed. A novel multiple time scale ensemble classifier and a novel sample-based online class imbalance learning procedure are proposed to handle the potential concept drift and class imbalance in the client credit assessment data streams. The experiments are carried out on two real-world client credit assessment cases, which present a comprehensive comparison between the proposed SOLE and other state-of-the-art online learning algorithms. In addition, the base classifier preference and the computing resource consumption of all the comparative algorithms are tested. In general, SOLE achieves a better performance than other methods using fewer computing resources. In addition, the results of the credit scoring model and the Kolmogorov−Smirnov (KS) test also prove that SOLE has good practicality in actual client credit assessment applications. |
topic |
credit assessment online learning ensemble learning sampling method |
url |
https://www.mdpi.com/2073-8994/11/7/890 |
work_keys_str_mv |
AT hangzhang onlinelearningmethodfordriftandimbalanceprobleminclientcreditassessment AT qingbaoliu onlinelearningmethodfordriftandimbalanceprobleminclientcreditassessment |
_version_ |
1724994641085857792 |