The Importance of the Data Complexity Indices on Classification Methods in Data Mining
碩士 === 淡江大學 === 統計學系碩士班 === 101 === Classification techniques in data mining are often used to deal with a variety of classification problems. Choosing suitable methods for analysis from many classification techniques becomes an important issue. For the performance evaluations of the classifiers, re...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/79384325767465254236 |
id |
ndltd-TW-101TKU05337005 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101TKU053370052015-10-13T22:35:34Z http://ndltd.ncl.edu.tw/handle/79384325767465254236 The Importance of the Data Complexity Indices on Classification Methods in Data Mining 資料複雜度指標在資料探勘分類方法之重要性 Shih Yung Wang 王詩詠 碩士 淡江大學 統計學系碩士班 101 Classification techniques in data mining are often used to deal with a variety of classification problems. Choosing suitable methods for analysis from many classification techniques becomes an important issue. For the performance evaluations of the classifiers, researchers used to compare them on several datasets in terms of classification accuracy or training time, and so on. In practice, however, different classification problems has their unique data complexities which might affect the accuracies of the classifiers. Therefore, we adopt fifteen data complexity indices to quantify the data characteristics and use correct classification rate to observe the influence of these indices on seven commonly used classification techniques. We also use factor analysis to explore the correlation among these indices. The results show that different data characteristics indeed have impacts on classification performance. According to our studies, for classification problems, researchers can calculate the data complexity indices or factor values suggested in this paper to estimate the classification difficulties, and also choose the most appropriate classification method on their study. 陳景祥 2013 學位論文 ; thesis 61 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 淡江大學 === 統計學系碩士班 === 101 === Classification techniques in data mining are often used to deal with a variety of classification problems. Choosing suitable methods for analysis from many classification techniques becomes an important issue. For the performance evaluations of the classifiers, researchers used to compare them on several datasets in terms of classification accuracy or training time, and so on. In practice, however, different classification problems has their unique data complexities which might affect the accuracies of the classifiers. Therefore, we adopt fifteen data complexity indices to quantify the data characteristics and use correct classification rate to observe the influence of these indices on seven commonly used classification techniques. We also use factor analysis to explore the correlation among these indices. The results show that different data characteristics indeed have impacts on classification performance. According to our studies, for classification problems, researchers can calculate the data complexity indices or factor values suggested in this paper to estimate the classification difficulties, and also choose the most appropriate classification method on their study.
|
author2 |
陳景祥 |
author_facet |
陳景祥 Shih Yung Wang 王詩詠 |
author |
Shih Yung Wang 王詩詠 |
spellingShingle |
Shih Yung Wang 王詩詠 The Importance of the Data Complexity Indices on Classification Methods in Data Mining |
author_sort |
Shih Yung Wang |
title |
The Importance of the Data Complexity Indices on Classification Methods in Data Mining |
title_short |
The Importance of the Data Complexity Indices on Classification Methods in Data Mining |
title_full |
The Importance of the Data Complexity Indices on Classification Methods in Data Mining |
title_fullStr |
The Importance of the Data Complexity Indices on Classification Methods in Data Mining |
title_full_unstemmed |
The Importance of the Data Complexity Indices on Classification Methods in Data Mining |
title_sort |
importance of the data complexity indices on classification methods in data mining |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/79384325767465254236 |
work_keys_str_mv |
AT shihyungwang theimportanceofthedatacomplexityindicesonclassificationmethodsindatamining AT wángshīyǒng theimportanceofthedatacomplexityindicesonclassificationmethodsindatamining AT shihyungwang zīliàofùzádùzhǐbiāozàizīliàotànkānfēnlèifāngfǎzhīzhòngyàoxìng AT wángshīyǒng zīliàofùzádùzhǐbiāozàizīliàotànkānfēnlèifāngfǎzhīzhòngyàoxìng AT shihyungwang importanceofthedatacomplexityindicesonclassificationmethodsindatamining AT wángshīyǒng importanceofthedatacomplexityindicesonclassificationmethodsindatamining |
_version_ |
1718078944738017280 |