Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment

Machine learning algorithms have been widely used in the field of client credit assessment. However, few of the algorithms have focused on and solved the problems of concept drift and class imbalance. Due to changes in the macroeconomic environment and markets, the relationship between client charac...

Full description

Bibliographic Details
Main Authors: Hang Zhang, Qingbao Liu
Format: Article
Language:English
Published: MDPI AG 2019-07-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/11/7/890
id doaj-61e299464a8944e995d3e8bc0827f5e6
record_format Article
spelling doaj-61e299464a8944e995d3e8bc0827f5e62020-11-25T01:52:32ZengMDPI AGSymmetry2073-89942019-07-0111789010.3390/sym11070890sym11070890Online Learning Method for Drift and Imbalance Problem in Client Credit AssessmentHang Zhang0Qingbao Liu1Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410073, ChinaScience and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410073, ChinaMachine learning algorithms have been widely used in the field of client credit assessment. However, few of the algorithms have focused on and solved the problems of concept drift and class imbalance. Due to changes in the macroeconomic environment and markets, the relationship between client characteristics and credit assessment results may change over time, causing concept drift in client credit assessments. Moreover, client credit assessment data are naturally asymmetric and class imbalanced because of the screening of clients. Aiming at solving the joint research issue of concept drift and class imbalance in client credit assessments, in this paper, a novel sample-based online learning ensemble (SOLE) for client credit assessment is proposed. A novel multiple time scale ensemble classifier and a novel sample-based online class imbalance learning procedure are proposed to handle the potential concept drift and class imbalance in the client credit assessment data streams. The experiments are carried out on two real-world client credit assessment cases, which present a comprehensive comparison between the proposed SOLE and other state-of-the-art online learning algorithms. In addition, the base classifier preference and the computing resource consumption of all the comparative algorithms are tested. In general, SOLE achieves a better performance than other methods using fewer computing resources. In addition, the results of the credit scoring model and the Kolmogorov−Smirnov (KS) test also prove that SOLE has good practicality in actual client credit assessment applications.https://www.mdpi.com/2073-8994/11/7/890credit assessmentonline learningensemble learningsampling method
collection DOAJ
language English
format Article
sources DOAJ
author Hang Zhang
Qingbao Liu
spellingShingle Hang Zhang
Qingbao Liu
Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment
Symmetry
credit assessment
online learning
ensemble learning
sampling method
author_facet Hang Zhang
Qingbao Liu
author_sort Hang Zhang
title Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment
title_short Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment
title_full Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment
title_fullStr Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment
title_full_unstemmed Online Learning Method for Drift and Imbalance Problem in Client Credit Assessment
title_sort online learning method for drift and imbalance problem in client credit assessment
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2019-07-01
description Machine learning algorithms have been widely used in the field of client credit assessment. However, few of the algorithms have focused on and solved the problems of concept drift and class imbalance. Due to changes in the macroeconomic environment and markets, the relationship between client characteristics and credit assessment results may change over time, causing concept drift in client credit assessments. Moreover, client credit assessment data are naturally asymmetric and class imbalanced because of the screening of clients. Aiming at solving the joint research issue of concept drift and class imbalance in client credit assessments, in this paper, a novel sample-based online learning ensemble (SOLE) for client credit assessment is proposed. A novel multiple time scale ensemble classifier and a novel sample-based online class imbalance learning procedure are proposed to handle the potential concept drift and class imbalance in the client credit assessment data streams. The experiments are carried out on two real-world client credit assessment cases, which present a comprehensive comparison between the proposed SOLE and other state-of-the-art online learning algorithms. In addition, the base classifier preference and the computing resource consumption of all the comparative algorithms are tested. In general, SOLE achieves a better performance than other methods using fewer computing resources. In addition, the results of the credit scoring model and the Kolmogorov−Smirnov (KS) test also prove that SOLE has good practicality in actual client credit assessment applications.
topic credit assessment
online learning
ensemble learning
sampling method
url https://www.mdpi.com/2073-8994/11/7/890
work_keys_str_mv AT hangzhang onlinelearningmethodfordriftandimbalanceprobleminclientcreditassessment
AT qingbaoliu onlinelearningmethodfordriftandimbalanceprobleminclientcreditassessment
_version_ 1724994641085857792