A Hybrid Classification Method Based on Binary Partition of Instances

碩士 === 國立成功大學 === 資訊管理研究所 === 105 === Classification is an essential task in data mining. Preprocess techniques are generally used to improve data quality for enhancing the performance of class prediction. The techniques for data preprocessing can be categorized as on attributes or on instances. A c...

Full description

Bibliographic Details
Main Authors: Guo-HongChen, 陳國鴻
Other Authors: Tzu-Tsung Wong
Format: Others
Language:zh-TW
Published: 2017
Online Access:http://ndltd.ncl.edu.tw/handle/r3v68z
id ndltd-TW-105NCKU5396010
record_format oai_dc
spelling ndltd-TW-105NCKU53960102019-05-15T23:47:01Z http://ndltd.ncl.edu.tw/handle/r3v68z A Hybrid Classification Method Based on Binary Partition of Instances 以資料二元分割方式為基礎的混合分類方法 Guo-HongChen 陳國鴻 碩士 國立成功大學 資訊管理研究所 105 Classification is an essential task in data mining. Preprocess techniques are generally used to improve data quality for enhancing the performance of class prediction. The techniques for data preprocessing can be categorized as on attributes or on instances. A classification algorithm is trained by the data that have been processed by another, and this is called hybrid classification. This study presents a hybrid classification algorithm that first divides a training set into two subsets by a classification algorithm. Then a model is learned from not only each of the two subsets, but also from the whole training set by another algorithm. Every test instance will be classified by one of the three models. The proposed hybrid classification algorithm is tested on 20 data sets for analyzing its prediction accuracy and computational efficiency. The experimental results show that our hybrid algorithm significantly outperforms naïve Bayesian classifier and decision tree learning in most data sets, while it needs more time to learn models. With respect to two hybrid classification algorithms proposed by other studies, our hybrid algorithm can have not only a significantly higher accuracy, but also a relatively lower computational cost. Tzu-Tsung Wong 翁慈宗 2017 學位論文 ; thesis 51 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 資訊管理研究所 === 105 === Classification is an essential task in data mining. Preprocess techniques are generally used to improve data quality for enhancing the performance of class prediction. The techniques for data preprocessing can be categorized as on attributes or on instances. A classification algorithm is trained by the data that have been processed by another, and this is called hybrid classification. This study presents a hybrid classification algorithm that first divides a training set into two subsets by a classification algorithm. Then a model is learned from not only each of the two subsets, but also from the whole training set by another algorithm. Every test instance will be classified by one of the three models. The proposed hybrid classification algorithm is tested on 20 data sets for analyzing its prediction accuracy and computational efficiency. The experimental results show that our hybrid algorithm significantly outperforms naïve Bayesian classifier and decision tree learning in most data sets, while it needs more time to learn models. With respect to two hybrid classification algorithms proposed by other studies, our hybrid algorithm can have not only a significantly higher accuracy, but also a relatively lower computational cost.
author2 Tzu-Tsung Wong
author_facet Tzu-Tsung Wong
Guo-HongChen
陳國鴻
author Guo-HongChen
陳國鴻
spellingShingle Guo-HongChen
陳國鴻
A Hybrid Classification Method Based on Binary Partition of Instances
author_sort Guo-HongChen
title A Hybrid Classification Method Based on Binary Partition of Instances
title_short A Hybrid Classification Method Based on Binary Partition of Instances
title_full A Hybrid Classification Method Based on Binary Partition of Instances
title_fullStr A Hybrid Classification Method Based on Binary Partition of Instances
title_full_unstemmed A Hybrid Classification Method Based on Binary Partition of Instances
title_sort hybrid classification method based on binary partition of instances
publishDate 2017
url http://ndltd.ncl.edu.tw/handle/r3v68z
work_keys_str_mv AT guohongchen ahybridclassificationmethodbasedonbinarypartitionofinstances
AT chénguóhóng ahybridclassificationmethodbasedonbinarypartitionofinstances
AT guohongchen yǐzīliàoèryuánfēngēfāngshìwèijīchǔdehùnhéfēnlèifāngfǎ
AT chénguóhóng yǐzīliàoèryuánfēngēfāngshìwèijīchǔdehùnhéfēnlèifāngfǎ
AT guohongchen hybridclassificationmethodbasedonbinarypartitionofinstances
AT chénguóhóng hybridclassificationmethodbasedonbinarypartitionofinstances
_version_ 1719154727829110784