Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring
Credit scoring is an important tool used by financial institutions to correctly identify<br />defaulters and non-defaulters. Support Vector Machines (SVM) and Random Forest (RF) are the<br />Artificial Intelligence techniques that have been attracting interest due to their flexibility to...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-09-01
|
Series: | Entropy |
Subjects: | |
Online Access: | https://www.mdpi.com/1099-4300/22/9/989 |
id |
doaj-61b541106c41463f9fffd21067fc67e6 |
---|---|
record_format |
Article |
spelling |
doaj-61b541106c41463f9fffd21067fc67e62020-11-25T03:16:39ZengMDPI AGEntropy1099-43002020-09-012298998910.3390/e22090989Hybrid Harmony Search–Artificial Intelligence Models in Credit ScoringRui Ying Goh0Lai Soon Lee1Hsin-Vonn Seow2Kathiresan Gopal3Laboratory of Computational Statistics and Operations Research, Institute for Mathematical Research, Universiti Putra Malaysia, Serdang, 43400 Selangor, MalaysiaLaboratory of Computational Statistics and Operations Research, Institute for Mathematical Research, Universiti Putra Malaysia, Serdang, 43400 Selangor, MalaysiaNottingham University Business School, University of Nottingham Malaysia, Semenyih, 43500 Selangor, MalaysiaLaboratory of Computational Statistics and Operations Research, Institute for Mathematical Research, Universiti Putra Malaysia, Serdang, 43400 Selangor, MalaysiaCredit scoring is an important tool used by financial institutions to correctly identify<br />defaulters and non-defaulters. Support Vector Machines (SVM) and Random Forest (RF) are the<br />Artificial Intelligence techniques that have been attracting interest due to their flexibility to account<br />for various data patterns. Both are black-box models which are sensitive to hyperparameter settings.<br />Feature selection can be performed on SVM to enable explanation with the reduced features,<br />whereas feature importance computed by RF can be used for model explanation. The benefits<br />of accuracy and interpretation allow for significant improvement in the area of credit risk and<br />credit scoring. This paper proposes the use of Harmony Search (HS), to form a hybrid HS-SVM to<br />perform feature selection and hyperparameter tuning simultaneously, and a hybrid HS-RF to tune<br />the hyperparameters. A Modified HS (MHS) is also proposed with the main objective to achieve<br />comparable results as the standard HS with a shorter computational time. MHS consists of four<br />main modifications in the standard HS: (i) Elitism selection during memory consideration instead<br />of random selection, (ii) dynamic exploration and exploitation operators in place of the original<br />static operators, (iii) a self-adjusted bandwidth operator, and (iv) inclusion of additional termination<br />criteria to reach faster convergence. Along with parallel computing, MHS effectively reduces the<br />computational time of the proposed hybrid models. The proposed hybrid models are compared<br />with standard statistical models across three different datasets commonly used in credit scoring<br />studies. The computational results show that MHS-RF is most robust in terms of model performance,<br />model explainability and computational time.https://www.mdpi.com/1099-4300/22/9/989credit scoringsupport vector machinesrandom forestharmony searchfeature selectionhyperparameter tuning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Rui Ying Goh Lai Soon Lee Hsin-Vonn Seow Kathiresan Gopal |
spellingShingle |
Rui Ying Goh Lai Soon Lee Hsin-Vonn Seow Kathiresan Gopal Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring Entropy credit scoring support vector machines random forest harmony search feature selection hyperparameter tuning |
author_facet |
Rui Ying Goh Lai Soon Lee Hsin-Vonn Seow Kathiresan Gopal |
author_sort |
Rui Ying Goh |
title |
Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring |
title_short |
Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring |
title_full |
Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring |
title_fullStr |
Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring |
title_full_unstemmed |
Hybrid Harmony Search–Artificial Intelligence Models in Credit Scoring |
title_sort |
hybrid harmony search–artificial intelligence models in credit scoring |
publisher |
MDPI AG |
series |
Entropy |
issn |
1099-4300 |
publishDate |
2020-09-01 |
description |
Credit scoring is an important tool used by financial institutions to correctly identify<br />defaulters and non-defaulters. Support Vector Machines (SVM) and Random Forest (RF) are the<br />Artificial Intelligence techniques that have been attracting interest due to their flexibility to account<br />for various data patterns. Both are black-box models which are sensitive to hyperparameter settings.<br />Feature selection can be performed on SVM to enable explanation with the reduced features,<br />whereas feature importance computed by RF can be used for model explanation. The benefits<br />of accuracy and interpretation allow for significant improvement in the area of credit risk and<br />credit scoring. This paper proposes the use of Harmony Search (HS), to form a hybrid HS-SVM to<br />perform feature selection and hyperparameter tuning simultaneously, and a hybrid HS-RF to tune<br />the hyperparameters. A Modified HS (MHS) is also proposed with the main objective to achieve<br />comparable results as the standard HS with a shorter computational time. MHS consists of four<br />main modifications in the standard HS: (i) Elitism selection during memory consideration instead<br />of random selection, (ii) dynamic exploration and exploitation operators in place of the original<br />static operators, (iii) a self-adjusted bandwidth operator, and (iv) inclusion of additional termination<br />criteria to reach faster convergence. Along with parallel computing, MHS effectively reduces the<br />computational time of the proposed hybrid models. The proposed hybrid models are compared<br />with standard statistical models across three different datasets commonly used in credit scoring<br />studies. The computational results show that MHS-RF is most robust in terms of model performance,<br />model explainability and computational time. |
topic |
credit scoring support vector machines random forest harmony search feature selection hyperparameter tuning |
url |
https://www.mdpi.com/1099-4300/22/9/989 |
work_keys_str_mv |
AT ruiyinggoh hybridharmonysearchartificialintelligencemodelsincreditscoring AT laisoonlee hybridharmonysearchartificialintelligencemodelsincreditscoring AT hsinvonnseow hybridharmonysearchartificialintelligencemodelsincreditscoring AT kathiresangopal hybridharmonysearchartificialintelligencemodelsincreditscoring |
_version_ |
1724634892624461824 |