Comparison of Imbalanced Data Classification Methods

碩士 === 國立臺北大學 === 統計學系 === 105 === Many practical data have imbalanced features such as fraud detection, medical diagnosis, facial recognition, etc. The common classification methods perform poorly under imbalanced data. Sampling techniques are proposed to improve the performance of the classificati...

Full description

Bibliographic Details
Main Authors: CHEN,YI-JEN, 陳逸真
Other Authors: HWANG,YI-TING
Format: Others
Language:zh-TW
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/ee5d82
id ndltd-TW-105NTPU0337016
record_format oai_dc
spelling ndltd-TW-105NTPU03370162019-05-15T23:31:53Z http://ndltd.ncl.edu.tw/handle/ee5d82 Comparison of Imbalanced Data Classification Methods 不平衡資料分類方法之比較 CHEN,YI-JEN 陳逸真 碩士 國立臺北大學 統計學系 105 Many practical data have imbalanced features such as fraud detection, medical diagnosis, facial recognition, etc. The common classification methods perform poorly under imbalanced data. Sampling techniques are proposed to improve the performance of the classification. Nevertheless, the overall performance of these sampling techniques implemented to the classification techniques are not evaluated thoroughly. This thesis uses the Monte Carlo simulation to discuss the performance of the classification and various sampling schemes under various imbalanced settings. Many indices are used to assess the performance. Some guidance is given based on the results of simulations. HWANG,YI-TING 黃怡婷 2017 學位論文 ; thesis 68 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北大學 === 統計學系 === 105 === Many practical data have imbalanced features such as fraud detection, medical diagnosis, facial recognition, etc. The common classification methods perform poorly under imbalanced data. Sampling techniques are proposed to improve the performance of the classification. Nevertheless, the overall performance of these sampling techniques implemented to the classification techniques are not evaluated thoroughly. This thesis uses the Monte Carlo simulation to discuss the performance of the classification and various sampling schemes under various imbalanced settings. Many indices are used to assess the performance. Some guidance is given based on the results of simulations.
author2 HWANG,YI-TING
author_facet HWANG,YI-TING
CHEN,YI-JEN
陳逸真
author CHEN,YI-JEN
陳逸真
spellingShingle CHEN,YI-JEN
陳逸真
Comparison of Imbalanced Data Classification Methods
author_sort CHEN,YI-JEN
title Comparison of Imbalanced Data Classification Methods
title_short Comparison of Imbalanced Data Classification Methods
title_full Comparison of Imbalanced Data Classification Methods
title_fullStr Comparison of Imbalanced Data Classification Methods
title_full_unstemmed Comparison of Imbalanced Data Classification Methods
title_sort comparison of imbalanced data classification methods
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/ee5d82
work_keys_str_mv AT chenyijen comparisonofimbalanceddataclassificationmethods
AT chényìzhēn comparisonofimbalanceddataclassificationmethods
AT chenyijen bùpínghéngzīliàofēnlèifāngfǎzhībǐjiào
AT chényìzhēn bùpínghéngzīliàofēnlèifāngfǎzhībǐjiào
_version_ 1719149027624222720