Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique

碩士 === 國立交通大學 === 工業工程與管理學系 === 100 === The main source of revenue of financial institutions is the interest they charge from their customers. But not all the customers will pay back their debt, financial institutions need to adopt some kind of risk assessment models in order to measure this credit...

Full description

Bibliographic Details
Main Authors: Yi-Hsien Lin, 林宜憲
Other Authors: 張永佳
Format: Others
Language:zh-TW
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/11786273799598686385
id ndltd-TW-100NCTU5031118
record_format oai_dc
spelling ndltd-TW-100NCTU50311182016-03-28T04:20:38Z http://ndltd.ncl.edu.tw/handle/11786273799598686385 Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique 應用增生少數合成技術建構信用風險評估模型 Yi-Hsien Lin 林宜憲 碩士 國立交通大學 工業工程與管理學系 100 The main source of revenue of financial institutions is the interest they charge from their customers. But not all the customers will pay back their debt, financial institutions need to adopt some kind of risk assessment models in order to measure this credit risk. It is not uncommon to observe class imbalance problem in finance risk data. Class imbalance problem is asymmetric categories within data, that is, there is one class of data (major class) significantly outnumbered others (minor class). If we trained a model with imbalanced data, while the accuracy of major class instances might be very well, it could have a poor predictive ability to identify minority instances. Most of the risk assessment models apply sampling to deal with the class imbalanced problem. However, sampling method may lead to lack of data integrity and the model is sensitive on the sampling result as to produce inaccurate problems. This study constructs a risk model using Synthetic Minority Over-sampling Technique (SMOTE) to tackle class imbalance problems. The model we proposed not only fixed the lack of data integrity, but also solved the poor minority class predictive ability issue, hence improved the overall model accuracy. In the end, the study compares the results of classification with several sampling methods and previous Granular Computing model. By calculation and compare of the accuracy, AUC and G-means, we can conclude that using Synthetic Minority Over-sampling Technique to construct risk models would have the same or even better result than sampling and Granular Computing model. 張永佳 2012 學位論文 ; thesis 40 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 工業工程與管理學系 === 100 === The main source of revenue of financial institutions is the interest they charge from their customers. But not all the customers will pay back their debt, financial institutions need to adopt some kind of risk assessment models in order to measure this credit risk. It is not uncommon to observe class imbalance problem in finance risk data. Class imbalance problem is asymmetric categories within data, that is, there is one class of data (major class) significantly outnumbered others (minor class). If we trained a model with imbalanced data, while the accuracy of major class instances might be very well, it could have a poor predictive ability to identify minority instances. Most of the risk assessment models apply sampling to deal with the class imbalanced problem. However, sampling method may lead to lack of data integrity and the model is sensitive on the sampling result as to produce inaccurate problems. This study constructs a risk model using Synthetic Minority Over-sampling Technique (SMOTE) to tackle class imbalance problems. The model we proposed not only fixed the lack of data integrity, but also solved the poor minority class predictive ability issue, hence improved the overall model accuracy. In the end, the study compares the results of classification with several sampling methods and previous Granular Computing model. By calculation and compare of the accuracy, AUC and G-means, we can conclude that using Synthetic Minority Over-sampling Technique to construct risk models would have the same or even better result than sampling and Granular Computing model.
author2 張永佳
author_facet 張永佳
Yi-Hsien Lin
林宜憲
author Yi-Hsien Lin
林宜憲
spellingShingle Yi-Hsien Lin
林宜憲
Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique
author_sort Yi-Hsien Lin
title Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique
title_short Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique
title_full Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique
title_fullStr Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique
title_full_unstemmed Constructing a Credit Risk Assessment Model using Synthetic Minority Over-sampling Technique
title_sort constructing a credit risk assessment model using synthetic minority over-sampling technique
publishDate 2012
url http://ndltd.ncl.edu.tw/handle/11786273799598686385
work_keys_str_mv AT yihsienlin constructingacreditriskassessmentmodelusingsyntheticminorityoversamplingtechnique
AT línyíxiàn constructingacreditriskassessmentmodelusingsyntheticminorityoversamplingtechnique
AT yihsienlin yīngyòngzēngshēngshǎoshùhéchéngjìshùjiàngòuxìnyòngfēngxiǎnpínggūmóxíng
AT línyíxiàn yīngyòngzēngshēngshǎoshùhéchéngjìshùjiàngòuxìnyòngfēngxiǎnpínggūmóxíng
_version_ 1718213303026581504