Credit Decision Support Based on Real Set of Cash Loans Using Integrated Machine Learning Algorithms
One of the important research problems in the context of financial institutions is the assessment of credit risk and the decision to whether grant or refuse a loan. Recently, machine learning based methods are increasingly employed to solve such problems. However, the selection of appropriate featur...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-08-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/10/17/2099 |
id |
doaj-643460a89ac445a1ba894bb821e33d6d |
---|---|
record_format |
Article |
spelling |
doaj-643460a89ac445a1ba894bb821e33d6d2021-09-09T13:42:07ZengMDPI AGElectronics2079-92922021-08-01102099209910.3390/electronics10172099Credit Decision Support Based on Real Set of Cash Loans Using Integrated Machine Learning AlgorithmsPaweł Ziemba0Jarosław Becker1Aneta Becker2Aleksandra Radomska-Zalas3Mateusz Pawluk4Dariusz Wierzba5Institute of Management, University of Szczecin, Aleja Papieża Jana Pawła II 22A, 70-453 Szczecin, PolandFaculty of Technology, The Jacob of Paradies University, Chopina 52, 66-400 Gorzów Wielkopolski, PolandFaculty of Economics, West Pomeranian University of Technology, Janickiego 31, 71-210 Szczecin, PolandFaculty of Technology, The Jacob of Paradies University, Chopina 52, 66-400 Gorzów Wielkopolski, PolandFaculty of Mathematics and Information Science, Informatics, Warsaw University of Technology, Koszykowa 75, 00-662 Warsaw, PolandFaculty of Economic Sciences, University of Warsaw, Długa 44/50, 00-241 Warsaw, PolandOne of the important research problems in the context of financial institutions is the assessment of credit risk and the decision to whether grant or refuse a loan. Recently, machine learning based methods are increasingly employed to solve such problems. However, the selection of appropriate feature selection technique, sampling mechanism, and/or classifiers for credit decision support is very challenging, and can affect the quality of the loan recommendations. To address this challenging task, this article examines the effectiveness of various data science techniques in issue of credit decision support. In particular, processing pipeline was designed, which consists of methods for data resampling, feature discretization, feature selection, and binary classification. We suggest building appropriate decision models leveraging pertinent methods for binary classification, feature selection, as well as data resampling and feature discretization. The selected models’ feasibility analysis was performed through rigorous experiments on real data describing the client’s ability for loan repayment. During experiments, we analyzed the impact of feature selection on the results of binary classification, and the impact of data resampling with feature discretization on the results of feature selection and binary classification. After experimental evaluation, we found that correlation-based feature selection technique and random forest classifier yield the superior performance in solving underlying problem.https://www.mdpi.com/2079-9292/10/17/2099credit scoringcash loansmachine learningdecision modelclassificationfeature selection |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Paweł Ziemba Jarosław Becker Aneta Becker Aleksandra Radomska-Zalas Mateusz Pawluk Dariusz Wierzba |
spellingShingle |
Paweł Ziemba Jarosław Becker Aneta Becker Aleksandra Radomska-Zalas Mateusz Pawluk Dariusz Wierzba Credit Decision Support Based on Real Set of Cash Loans Using Integrated Machine Learning Algorithms Electronics credit scoring cash loans machine learning decision model classification feature selection |
author_facet |
Paweł Ziemba Jarosław Becker Aneta Becker Aleksandra Radomska-Zalas Mateusz Pawluk Dariusz Wierzba |
author_sort |
Paweł Ziemba |
title |
Credit Decision Support Based on Real Set of Cash Loans Using Integrated Machine Learning Algorithms |
title_short |
Credit Decision Support Based on Real Set of Cash Loans Using Integrated Machine Learning Algorithms |
title_full |
Credit Decision Support Based on Real Set of Cash Loans Using Integrated Machine Learning Algorithms |
title_fullStr |
Credit Decision Support Based on Real Set of Cash Loans Using Integrated Machine Learning Algorithms |
title_full_unstemmed |
Credit Decision Support Based on Real Set of Cash Loans Using Integrated Machine Learning Algorithms |
title_sort |
credit decision support based on real set of cash loans using integrated machine learning algorithms |
publisher |
MDPI AG |
series |
Electronics |
issn |
2079-9292 |
publishDate |
2021-08-01 |
description |
One of the important research problems in the context of financial institutions is the assessment of credit risk and the decision to whether grant or refuse a loan. Recently, machine learning based methods are increasingly employed to solve such problems. However, the selection of appropriate feature selection technique, sampling mechanism, and/or classifiers for credit decision support is very challenging, and can affect the quality of the loan recommendations. To address this challenging task, this article examines the effectiveness of various data science techniques in issue of credit decision support. In particular, processing pipeline was designed, which consists of methods for data resampling, feature discretization, feature selection, and binary classification. We suggest building appropriate decision models leveraging pertinent methods for binary classification, feature selection, as well as data resampling and feature discretization. The selected models’ feasibility analysis was performed through rigorous experiments on real data describing the client’s ability for loan repayment. During experiments, we analyzed the impact of feature selection on the results of binary classification, and the impact of data resampling with feature discretization on the results of feature selection and binary classification. After experimental evaluation, we found that correlation-based feature selection technique and random forest classifier yield the superior performance in solving underlying problem. |
topic |
credit scoring cash loans machine learning decision model classification feature selection |
url |
https://www.mdpi.com/2079-9292/10/17/2099 |
work_keys_str_mv |
AT pawełziemba creditdecisionsupportbasedonrealsetofcashloansusingintegratedmachinelearningalgorithms AT jarosławbecker creditdecisionsupportbasedonrealsetofcashloansusingintegratedmachinelearningalgorithms AT anetabecker creditdecisionsupportbasedonrealsetofcashloansusingintegratedmachinelearningalgorithms AT aleksandraradomskazalas creditdecisionsupportbasedonrealsetofcashloansusingintegratedmachinelearningalgorithms AT mateuszpawluk creditdecisionsupportbasedonrealsetofcashloansusingintegratedmachinelearningalgorithms AT dariuszwierzba creditdecisionsupportbasedonrealsetofcashloansusingintegratedmachinelearningalgorithms |
_version_ |
1717760626647891968 |