Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring

Credit scoring is an important tool used by financial institutions to correctly identify<br />defaulters and non-defaulters. Support Vector Machines (SVM) and Random Forest (RF) are the<br />Artificial Intelligence techniques that have been attracting interest due to their flexibility to...

Full description

Bibliographic Details
Main Authors: Rui Ying Goh, Lai Soon Lee, Hsin-Vonn Seow, Kathiresan Gopal
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Entropy
Subjects:
Online Access:https://www.mdpi.com/1099-4300/22/9/989
id doaj-61b541106c41463f9fffd21067fc67e6
record_format Article
spelling doaj-61b541106c41463f9fffd21067fc67e62020-11-25T03:16:39ZengMDPI AGEntropy1099-43002020-09-012298998910.3390/e22090989Hybrid Harmony Search–Artificial Intelligence Models in Credit ScoringRui Ying Goh0Lai Soon Lee1Hsin-Vonn Seow2Kathiresan Gopal3Laboratory of Computational Statistics and Operations Research, Institute for Mathematical Research, Universiti Putra Malaysia, Serdang, 43400 Selangor, MalaysiaLaboratory of Computational Statistics and Operations Research, Institute for Mathematical Research, Universiti Putra Malaysia, Serdang, 43400 Selangor, MalaysiaNottingham University Business School, University of Nottingham Malaysia, Semenyih, 43500 Selangor, MalaysiaLaboratory of Computational Statistics and Operations Research, Institute for Mathematical Research, Universiti Putra Malaysia, Serdang, 43400 Selangor, MalaysiaCredit scoring is an important tool used by financial institutions to correctly identify<br />defaulters and non-defaulters. Support Vector Machines (SVM) and Random Forest (RF) are the<br />Artificial Intelligence techniques that have been attracting interest due to their flexibility to account<br />for various data patterns. Both are black-box models which are sensitive to hyperparameter settings.<br />Feature selection can be performed on SVM to enable explanation with the reduced features,<br />whereas feature importance computed by RF can be used for model explanation. The benefits<br />of accuracy and interpretation allow for significant improvement in the area of credit risk and<br />credit scoring. This paper proposes the use of Harmony Search (HS), to form a hybrid HS-SVM to<br />perform feature selection and hyperparameter tuning simultaneously, and a hybrid HS-RF to tune<br />the hyperparameters. A Modified HS (MHS) is also proposed with the main objective to achieve<br />comparable results as the standard HS with a shorter computational time. MHS consists of four<br />main modifications in the standard HS: (i) Elitism selection during memory consideration instead<br />of random selection, (ii) dynamic exploration and exploitation operators in place of the original<br />static operators, (iii) a self-adjusted bandwidth operator, and (iv) inclusion of additional termination<br />criteria to reach faster convergence. Along with parallel computing, MHS effectively reduces the<br />computational time of the proposed hybrid models. The proposed hybrid models are compared<br />with standard statistical models across three different datasets commonly used in credit scoring<br />studies. The computational results show that MHS-RF is most robust in terms of model performance,<br />model explainability and computational time.https://www.mdpi.com/1099-4300/22/9/989credit scoringsupport vector machinesrandom forestharmony searchfeature selectionhyperparameter tuning
collection DOAJ
language English
format Article
sources DOAJ
author Rui Ying Goh
Lai Soon Lee
Hsin-Vonn Seow
Kathiresan Gopal
spellingShingle Rui Ying Goh
Lai Soon Lee
Hsin-Vonn Seow
Kathiresan Gopal
Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring
Entropy
credit scoring
support vector machines
random forest
harmony search
feature selection
hyperparameter tuning
author_facet Rui Ying Goh
Lai Soon Lee
Hsin-Vonn Seow
Kathiresan Gopal
author_sort Rui Ying Goh
title Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring
title_short Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring
title_full Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring
title_fullStr Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring
title_full_unstemmed Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring
title_sort hybrid harmony search–artificial intelligence models in credit scoring
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2020-09-01
description Credit scoring is an important tool used by financial institutions to correctly identify<br />defaulters and non-defaulters. Support Vector Machines (SVM) and Random Forest (RF) are the<br />Artificial Intelligence techniques that have been attracting interest due to their flexibility to account<br />for various data patterns. Both are black-box models which are sensitive to hyperparameter settings.<br />Feature selection can be performed on SVM to enable explanation with the reduced features,<br />whereas feature importance computed by RF can be used for model explanation. The benefits<br />of accuracy and interpretation allow for significant improvement in the area of credit risk and<br />credit scoring. This paper proposes the use of Harmony Search (HS), to form a hybrid HS-SVM to<br />perform feature selection and hyperparameter tuning simultaneously, and a hybrid HS-RF to tune<br />the hyperparameters. A Modified HS (MHS) is also proposed with the main objective to achieve<br />comparable results as the standard HS with a shorter computational time. MHS consists of four<br />main modifications in the standard HS: (i) Elitism selection during memory consideration instead<br />of random selection, (ii) dynamic exploration and exploitation operators in place of the original<br />static operators, (iii) a self-adjusted bandwidth operator, and (iv) inclusion of additional termination<br />criteria to reach faster convergence. Along with parallel computing, MHS effectively reduces the<br />computational time of the proposed hybrid models. The proposed hybrid models are compared<br />with standard statistical models across three different datasets commonly used in credit scoring<br />studies. The computational results show that MHS-RF is most robust in terms of model performance,<br />model explainability and computational time.
topic credit scoring
support vector machines
random forest
harmony search
feature selection
hyperparameter tuning
url https://www.mdpi.com/1099-4300/22/9/989
work_keys_str_mv AT ruiyinggoh hybridharmonysearchartificialintelligencemodelsincreditscoring
AT laisoonlee hybridharmonysearchartificialintelligencemodelsincreditscoring
AT hsinvonnseow hybridharmonysearchartificialintelligencemodelsincreditscoring
AT kathiresangopal hybridharmonysearchartificialintelligencemodelsincreditscoring
_version_ 1724634892624461824