Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate

碩士 === 國立中正大學 === 資訊管理所 === 95 === Regarding the non-skewed distribution, to utilize the existing data mining classification to construct the prediction model can reach a certain level of prediction accuracy. However, in the real data mining case, the dataset distribution is always skewed distributi...

Full description

Bibliographic Details
Main Authors: Li Wei, 劉力瑋
Other Authors: Fan Wu
Format: Others
Language:en_US
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/78752271295978172163
id ndltd-TW-095CCU05396007
record_format oai_dc
spelling ndltd-TW-095CCU053960072015-10-13T10:45:19Z http://ndltd.ncl.edu.tw/handle/78752271295978172163 Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate 不對稱分佈混合增加少數法及多專家分類法之研究-以攝護腺肥大及攝護腺癌偵測為例 Li Wei 劉力瑋 碩士 國立中正大學 資訊管理所 95 Regarding the non-skewed distribution, to utilize the existing data mining classification to construct the prediction model can reach a certain level of prediction accuracy. However, in the real data mining case, the dataset distribution is always skewed distribution. In clinical case, because the number of healthy people is more than the number of unhealthy people, the collected data would be congenital skewed distribution. If we utilize those dataset with skewed distribution to construct the prediction model, the prediction deviation should be a big problem. There are three existing solutions for skewed distribution – Under-sampling, Over-sampling, and Multi-classifier Committee Approach. This research will utilize Over-sampling and Multi-classifier Committee Approach for skewed distribution and improve them. The research objective is to raise the prediction accuracy of the minor part of the dataset. The case study is the disease of benign prostate hypertrophy and cancer of prostate. And this research will use those data to test the classification efficiency of my algorithm. Fan Wu 吳帆 2006 學位論文 ; thesis 40 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立中正大學 === 資訊管理所 === 95 === Regarding the non-skewed distribution, to utilize the existing data mining classification to construct the prediction model can reach a certain level of prediction accuracy. However, in the real data mining case, the dataset distribution is always skewed distribution. In clinical case, because the number of healthy people is more than the number of unhealthy people, the collected data would be congenital skewed distribution. If we utilize those dataset with skewed distribution to construct the prediction model, the prediction deviation should be a big problem. There are three existing solutions for skewed distribution – Under-sampling, Over-sampling, and Multi-classifier Committee Approach. This research will utilize Over-sampling and Multi-classifier Committee Approach for skewed distribution and improve them. The research objective is to raise the prediction accuracy of the minor part of the dataset. The case study is the disease of benign prostate hypertrophy and cancer of prostate. And this research will use those data to test the classification efficiency of my algorithm.
author2 Fan Wu
author_facet Fan Wu
Li Wei
劉力瑋
author Li Wei
劉力瑋
spellingShingle Li Wei
劉力瑋
Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate
author_sort Li Wei
title Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate
title_short Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate
title_full Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate
title_fullStr Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate
title_full_unstemmed Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate
title_sort using over-sampling and multi-classifier committee approach for skewed class distribution – a case study of diagnosis model construction of benign prostate hypertrophy and cancer of prostate
publishDate 2006
url http://ndltd.ncl.edu.tw/handle/78752271295978172163
work_keys_str_mv AT liwei usingoversamplingandmulticlassifiercommitteeapproachforskewedclassdistributionacasestudyofdiagnosismodelconstructionofbenignprostatehypertrophyandcancerofprostate
AT liúlìwěi usingoversamplingandmulticlassifiercommitteeapproachforskewedclassdistributionacasestudyofdiagnosismodelconstructionofbenignprostatehypertrophyandcancerofprostate
AT liwei bùduìchēngfēnbùhùnhézēngjiāshǎoshùfǎjíduōzhuānjiāfēnlèifǎzhīyánjiūyǐshèhùxiànféidàjíshèhùxiànáizhēncèwèilì
AT liúlìwěi bùduìchēngfēnbùhùnhézēngjiāshǎoshùfǎjíduōzhuānjiāfēnlèifǎzhīyánjiūyǐshèhùxiànféidàjíshèhùxiànáizhēncèwèilì
_version_ 1716833273928220672