Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate
碩士 === 國立中正大學 === 資訊管理所 === 95 === Regarding the non-skewed distribution, to utilize the existing data mining classification to construct the prediction model can reach a certain level of prediction accuracy. However, in the real data mining case, the dataset distribution is always skewed distributi...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2006
|
Online Access: | http://ndltd.ncl.edu.tw/handle/78752271295978172163 |
id |
ndltd-TW-095CCU05396007 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-095CCU053960072015-10-13T10:45:19Z http://ndltd.ncl.edu.tw/handle/78752271295978172163 Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate 不對稱分佈混合增加少數法及多專家分類法之研究-以攝護腺肥大及攝護腺癌偵測為例 Li Wei 劉力瑋 碩士 國立中正大學 資訊管理所 95 Regarding the non-skewed distribution, to utilize the existing data mining classification to construct the prediction model can reach a certain level of prediction accuracy. However, in the real data mining case, the dataset distribution is always skewed distribution. In clinical case, because the number of healthy people is more than the number of unhealthy people, the collected data would be congenital skewed distribution. If we utilize those dataset with skewed distribution to construct the prediction model, the prediction deviation should be a big problem. There are three existing solutions for skewed distribution – Under-sampling, Over-sampling, and Multi-classifier Committee Approach. This research will utilize Over-sampling and Multi-classifier Committee Approach for skewed distribution and improve them. The research objective is to raise the prediction accuracy of the minor part of the dataset. The case study is the disease of benign prostate hypertrophy and cancer of prostate. And this research will use those data to test the classification efficiency of my algorithm. Fan Wu 吳帆 2006 學位論文 ; thesis 40 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中正大學 === 資訊管理所 === 95 === Regarding the non-skewed distribution, to utilize the existing data mining classification to construct the prediction model can reach a certain level of prediction accuracy. However, in the real data mining case, the dataset distribution is always skewed distribution. In clinical case, because the number of healthy people is more than the number of unhealthy people, the collected data would be congenital skewed distribution. If we utilize those dataset with skewed distribution to construct the prediction model, the prediction deviation should be a big problem.
There are three existing solutions for skewed distribution – Under-sampling, Over-sampling, and Multi-classifier Committee Approach. This research will utilize Over-sampling and Multi-classifier Committee Approach for skewed distribution and improve them. The research objective is to raise the prediction accuracy of the minor part of the dataset. The case study is the disease of benign prostate hypertrophy and cancer of prostate. And this research will use those data to test the classification efficiency of my algorithm.
|
author2 |
Fan Wu |
author_facet |
Fan Wu Li Wei 劉力瑋 |
author |
Li Wei 劉力瑋 |
spellingShingle |
Li Wei 劉力瑋 Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate |
author_sort |
Li Wei |
title |
Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate |
title_short |
Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate |
title_full |
Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate |
title_fullStr |
Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate |
title_full_unstemmed |
Using Over-sampling and Multi-classifier Committee Approach for skewed class distribution – a case study of diagnosis model construction of Benign prostate hypertrophy and Cancer of prostate |
title_sort |
using over-sampling and multi-classifier committee approach for skewed class distribution – a case study of diagnosis model construction of benign prostate hypertrophy and cancer of prostate |
publishDate |
2006 |
url |
http://ndltd.ncl.edu.tw/handle/78752271295978172163 |
work_keys_str_mv |
AT liwei usingoversamplingandmulticlassifiercommitteeapproachforskewedclassdistributionacasestudyofdiagnosismodelconstructionofbenignprostatehypertrophyandcancerofprostate AT liúlìwěi usingoversamplingandmulticlassifiercommitteeapproachforskewedclassdistributionacasestudyofdiagnosismodelconstructionofbenignprostatehypertrophyandcancerofprostate AT liwei bùduìchēngfēnbùhùnhézēngjiāshǎoshùfǎjíduōzhuānjiāfēnlèifǎzhīyánjiūyǐshèhùxiànféidàjíshèhùxiànáizhēncèwèilì AT liúlìwěi bùduìchēngfēnbùhùnhézēngjiāshǎoshùfǎjíduōzhuānjiāfēnlèifǎzhīyánjiūyǐshèhùxiànféidàjíshèhùxiànáizhēncèwèilì |
_version_ |
1716833273928220672 |