The Importance of the Data Complexity Indices on Classification Methods in Data Mining

碩士 === 淡江大學 === 統計學系碩士班 === 101 === Classification techniques in data mining are often used to deal with a variety of classification problems. Choosing suitable methods for analysis from many classification techniques becomes an important issue. For the performance evaluations of the classifiers, re...

Full description

Bibliographic Details
Main Authors: Shih Yung Wang, 王詩詠
Other Authors: 陳景祥
Format: Others
Language:zh-TW
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/79384325767465254236
id ndltd-TW-101TKU05337005
record_format oai_dc
spelling ndltd-TW-101TKU053370052015-10-13T22:35:34Z http://ndltd.ncl.edu.tw/handle/79384325767465254236 The Importance of the Data Complexity Indices on Classification Methods in Data Mining 資料複雜度指標在資料探勘分類方法之重要性 Shih Yung Wang 王詩詠 碩士 淡江大學 統計學系碩士班 101 Classification techniques in data mining are often used to deal with a variety of classification problems. Choosing suitable methods for analysis from many classification techniques becomes an important issue. For the performance evaluations of the classifiers, researchers used to compare them on several datasets in terms of classification accuracy or training time, and so on. In practice, however, different classification problems has their unique data complexities which might affect the accuracies of the classifiers. Therefore, we adopt fifteen data complexity indices to quantify the data characteristics and use correct classification rate to observe the influence of these indices on seven commonly used classification techniques. We also use factor analysis to explore the correlation among these indices. The results show that different data characteristics indeed have impacts on classification performance. According to our studies, for classification problems, researchers can calculate the data complexity indices or factor values suggested in this paper to estimate the classification difficulties, and also choose the most appropriate classification method on their study. 陳景祥 2013 學位論文 ; thesis 61 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 淡江大學 === 統計學系碩士班 === 101 === Classification techniques in data mining are often used to deal with a variety of classification problems. Choosing suitable methods for analysis from many classification techniques becomes an important issue. For the performance evaluations of the classifiers, researchers used to compare them on several datasets in terms of classification accuracy or training time, and so on. In practice, however, different classification problems has their unique data complexities which might affect the accuracies of the classifiers. Therefore, we adopt fifteen data complexity indices to quantify the data characteristics and use correct classification rate to observe the influence of these indices on seven commonly used classification techniques. We also use factor analysis to explore the correlation among these indices. The results show that different data characteristics indeed have impacts on classification performance. According to our studies, for classification problems, researchers can calculate the data complexity indices or factor values suggested in this paper to estimate the classification difficulties, and also choose the most appropriate classification method on their study.
author2 陳景祥
author_facet 陳景祥
Shih Yung Wang
王詩詠
author Shih Yung Wang
王詩詠
spellingShingle Shih Yung Wang
王詩詠
The Importance of the Data Complexity Indices on Classification Methods in Data Mining
author_sort Shih Yung Wang
title The Importance of the Data Complexity Indices on Classification Methods in Data Mining
title_short The Importance of the Data Complexity Indices on Classification Methods in Data Mining
title_full The Importance of the Data Complexity Indices on Classification Methods in Data Mining
title_fullStr The Importance of the Data Complexity Indices on Classification Methods in Data Mining
title_full_unstemmed The Importance of the Data Complexity Indices on Classification Methods in Data Mining
title_sort importance of the data complexity indices on classification methods in data mining
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/79384325767465254236
work_keys_str_mv AT shihyungwang theimportanceofthedatacomplexityindicesonclassificationmethodsindatamining
AT wángshīyǒng theimportanceofthedatacomplexityindicesonclassificationmethodsindatamining
AT shihyungwang zīliàofùzádùzhǐbiāozàizīliàotànkānfēnlèifāngfǎzhīzhòngyàoxìng
AT wángshīyǒng zīliàofùzádùzhǐbiāozàizīliàotànkānfēnlèifāngfǎzhīzhòngyàoxìng
AT shihyungwang importanceofthedatacomplexityindicesonclassificationmethodsindatamining
AT wángshīyǒng importanceofthedatacomplexityindicesonclassificationmethodsindatamining
_version_ 1718078944738017280