Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique
碩士 === 國立交通大學 === 工業工程與管理學系 === 100 === The main source of revenue of financial institutions is the interest they charge from their customers. But not all the customers will pay back their debt, financial institutions need to adopt some kind of risk assessment models in order to measure this credit...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2012
|
Online Access: | http://ndltd.ncl.edu.tw/handle/11786273799598686385 |
id |
ndltd-TW-100NCTU5031118 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100NCTU50311182016-03-28T04:20:38Z http://ndltd.ncl.edu.tw/handle/11786273799598686385 Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique 應用增生少數合成技術建構信用風險評估模型 Yi-Hsien Lin 林宜憲 碩士 國立交通大學 工業工程與管理學系 100 The main source of revenue of financial institutions is the interest they charge from their customers. But not all the customers will pay back their debt, financial institutions need to adopt some kind of risk assessment models in order to measure this credit risk. It is not uncommon to observe class imbalance problem in finance risk data. Class imbalance problem is asymmetric categories within data, that is, there is one class of data (major class) significantly outnumbered others (minor class). If we trained a model with imbalanced data, while the accuracy of major class instances might be very well, it could have a poor predictive ability to identify minority instances. Most of the risk assessment models apply sampling to deal with the class imbalanced problem. However, sampling method may lead to lack of data integrity and the model is sensitive on the sampling result as to produce inaccurate problems. This study constructs a risk model using Synthetic Minority Over-sampling Technique (SMOTE) to tackle class imbalance problems. The model we proposed not only fixed the lack of data integrity, but also solved the poor minority class predictive ability issue, hence improved the overall model accuracy. In the end, the study compares the results of classification with several sampling methods and previous Granular Computing model. By calculation and compare of the accuracy, AUC and G-means, we can conclude that using Synthetic Minority Over-sampling Technique to construct risk models would have the same or even better result than sampling and Granular Computing model. 張永佳 2012 學位論文 ; thesis 40 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 工業工程與管理學系 === 100 === The main source of revenue of financial institutions is the interest they charge from their customers. But not all the customers will pay back their debt, financial institutions need to adopt some kind of risk assessment models in order to measure this credit risk. It is not uncommon to observe class imbalance problem in finance risk data. Class imbalance problem is asymmetric categories within data, that is, there is one class of data (major class) significantly outnumbered others (minor class). If we trained a model with imbalanced data, while the accuracy of major class instances might be very well, it could have a poor predictive ability to identify minority instances. Most of the risk assessment models apply sampling to deal with the class imbalanced problem. However, sampling method may lead to lack of data integrity and the model is sensitive on the sampling result as to produce inaccurate problems. This study constructs a risk model using Synthetic Minority Over-sampling Technique (SMOTE) to tackle class imbalance problems. The model we proposed not only fixed the lack of data integrity, but also solved the poor minority class predictive ability issue, hence improved the overall model accuracy. In the end, the study compares the results of classification with several sampling methods and previous Granular Computing model. By calculation and compare of the accuracy, AUC and G-means, we can conclude that using Synthetic Minority Over-sampling Technique to construct risk models would have the same or even better result than sampling and Granular Computing model.
|
author2 |
張永佳 |
author_facet |
張永佳 Yi-Hsien Lin 林宜憲 |
author |
Yi-Hsien Lin 林宜憲 |
spellingShingle |
Yi-Hsien Lin 林宜憲 Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique |
author_sort |
Yi-Hsien Lin |
title |
Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique |
title_short |
Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique |
title_full |
Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique |
title_fullStr |
Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique |
title_full_unstemmed |
Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique |
title_sort |
constructing a credit risk assessment model using synthetic minority over-sampling technique |
publishDate |
2012 |
url |
http://ndltd.ncl.edu.tw/handle/11786273799598686385 |
work_keys_str_mv |
AT yihsienlin constructingacreditriskassessmentmodelusingsyntheticminorityoversamplingtechnique AT línyíxiàn constructingacreditriskassessmentmodelusingsyntheticminorityoversamplingtechnique AT yihsienlin yīngyòngzēngshēngshǎoshùhéchéngjìshùjiàngòuxìnyòngfēngxiǎnpínggūmóxíng AT línyíxiàn yīngyòngzēngshēngshǎoshùhéchéngjìshùjiàngòuxìnyòngfēngxiǎnpínggūmóxíng |
_version_ |
1718213303026581504 |