A Hybrid Classification Method Based on Binary Partition of Instances
碩士 === 國立成功大學 === 資訊管理研究所 === 105 === Classification is an essential task in data mining. Preprocess techniques are generally used to improve data quality for enhancing the performance of class prediction. The techniques for data preprocessing can be categorized as on attributes or on instances. A c...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2017
|
Online Access: | http://ndltd.ncl.edu.tw/handle/r3v68z |
id |
ndltd-TW-105NCKU5396010 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-105NCKU53960102019-05-15T23:47:01Z http://ndltd.ncl.edu.tw/handle/r3v68z A Hybrid Classification Method Based on Binary Partition of Instances 以資料二元分割方式為基礎的混合分類方法 Guo-HongChen 陳國鴻 碩士 國立成功大學 資訊管理研究所 105 Classification is an essential task in data mining. Preprocess techniques are generally used to improve data quality for enhancing the performance of class prediction. The techniques for data preprocessing can be categorized as on attributes or on instances. A classification algorithm is trained by the data that have been processed by another, and this is called hybrid classification. This study presents a hybrid classification algorithm that first divides a training set into two subsets by a classification algorithm. Then a model is learned from not only each of the two subsets, but also from the whole training set by another algorithm. Every test instance will be classified by one of the three models. The proposed hybrid classification algorithm is tested on 20 data sets for analyzing its prediction accuracy and computational efficiency. The experimental results show that our hybrid algorithm significantly outperforms naïve Bayesian classifier and decision tree learning in most data sets, while it needs more time to learn models. With respect to two hybrid classification algorithms proposed by other studies, our hybrid algorithm can have not only a significantly higher accuracy, but also a relatively lower computational cost. Tzu-Tsung Wong 翁慈宗 2017 學位論文 ; thesis 51 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 資訊管理研究所 === 105 === Classification is an essential task in data mining. Preprocess techniques are generally used to improve data quality for enhancing the performance of class prediction. The techniques for data preprocessing can be categorized as on attributes or on instances. A classification algorithm is trained by the data that have been processed by another, and this is called hybrid classification. This study presents a hybrid classification algorithm that first divides a training set into two subsets by a classification algorithm. Then a model is learned from not only each of the two subsets, but also from the whole training set by another algorithm. Every test instance will be classified by one of the three models. The proposed hybrid classification algorithm is tested on 20 data sets for analyzing its prediction accuracy and computational efficiency. The experimental results show that our hybrid algorithm significantly outperforms naïve Bayesian classifier and decision tree learning in most data sets, while it needs more time to learn models. With respect to two hybrid classification algorithms proposed by other studies, our hybrid algorithm can have not only a significantly higher accuracy, but also a relatively lower computational cost.
|
author2 |
Tzu-Tsung Wong |
author_facet |
Tzu-Tsung Wong Guo-HongChen 陳國鴻 |
author |
Guo-HongChen 陳國鴻 |
spellingShingle |
Guo-HongChen 陳國鴻 A Hybrid Classification Method Based on Binary Partition of Instances |
author_sort |
Guo-HongChen |
title |
A Hybrid Classification Method Based on Binary Partition of Instances |
title_short |
A Hybrid Classification Method Based on Binary Partition of Instances |
title_full |
A Hybrid Classification Method Based on Binary Partition of Instances |
title_fullStr |
A Hybrid Classification Method Based on Binary Partition of Instances |
title_full_unstemmed |
A Hybrid Classification Method Based on Binary Partition of Instances |
title_sort |
hybrid classification method based on binary partition of instances |
publishDate |
2017 |
url |
http://ndltd.ncl.edu.tw/handle/r3v68z |
work_keys_str_mv |
AT guohongchen ahybridclassificationmethodbasedonbinarypartitionofinstances AT chénguóhóng ahybridclassificationmethodbasedonbinarypartitionofinstances AT guohongchen yǐzīliàoèryuánfēngēfāngshìwèijīchǔdehùnhéfēnlèifāngfǎ AT chénguóhóng yǐzīliàoèryuánfēngēfāngshìwèijīchǔdehùnhéfēnlèifāngfǎ AT guohongchen hybridclassificationmethodbasedonbinarypartitionofinstances AT chénguóhóng hybridclassificationmethodbasedonbinarypartitionofinstances |
_version_ |
1719154727829110784 |