Decision tree induction with constrained number of leaf node
碩士 === 國立中央大學 === 資訊管理研究所 === 97 === Classification, which builds a data classification model based on attribute value and label of existing data, is a very widespread data mining technology. Decision tree is one of the most popular classification technologies, because it is easy to understand and...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2009
|
Online Access: | http://ndltd.ncl.edu.tw/handle/91700158014594986498 |
id |
ndltd-TW-097NCU05396083 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-097NCU053960832015-11-16T16:08:55Z http://ndltd.ncl.edu.tw/handle/91700158014594986498 Decision tree induction with constrained number of leaf node 樹葉節點數目限制下的決策樹建構 Xiang-Yu Yang 楊翔宇 碩士 國立中央大學 資訊管理研究所 97 Classification, which builds a data classification model based on attribute value and label of existing data, is a very widespread data mining technology. Decision tree is one of the most popular classification technologies, because it is easy to understand and has the high efficiency computing. Decision tree is widely applied to signal classification, expert system, and medical diagnosis. Because of the noise data and special case of training data sets, decision tree is always huge and it contains too many branches and rules which are difficult to understand. This shortcoming reduces the availability of decision tree. Therefore, we reduce rules from a decision tree by limiting the number of leaf nodes of the decision tree and achieve the highest accuracy with the number of leaf nodes given by user. For this purpose, we propose a new algorithm. We use the agglomerative approach of the hierarchical clustering to limit the decision tree to binary tree by combining the branches of decision tree. Experiment results show that compared with the C4.5, the proposed algorithm successfully reduces the number of leaf nodes and makes better accuracy. Yan-liang Chen 陳彥良 2009 學位論文 ; thesis 44 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中央大學 === 資訊管理研究所 === 97 === Classification, which builds a data classification model based on attribute value and label of existing data, is a very widespread data mining technology. Decision tree is one of the most popular classification technologies, because it is easy to understand and has the high efficiency computing. Decision tree is widely applied to signal classification, expert system, and medical diagnosis. Because of the noise data and special case of training data sets, decision tree is always huge and it contains too many branches and rules which are difficult to understand. This shortcoming reduces the availability of decision tree.
Therefore, we reduce rules from a decision tree by limiting the number of leaf nodes of the decision tree and achieve the highest accuracy with the number of leaf nodes given by user. For this purpose, we propose a new algorithm. We use the agglomerative approach of the hierarchical clustering to limit the decision tree to binary tree by combining the branches of decision tree.
Experiment results show that compared with the C4.5, the proposed algorithm successfully reduces the number of leaf nodes and makes better accuracy.
|
author2 |
Yan-liang Chen |
author_facet |
Yan-liang Chen Xiang-Yu Yang 楊翔宇 |
author |
Xiang-Yu Yang 楊翔宇 |
spellingShingle |
Xiang-Yu Yang 楊翔宇 Decision tree induction with constrained number of leaf node |
author_sort |
Xiang-Yu Yang |
title |
Decision tree induction with constrained number of leaf node |
title_short |
Decision tree induction with constrained number of leaf node |
title_full |
Decision tree induction with constrained number of leaf node |
title_fullStr |
Decision tree induction with constrained number of leaf node |
title_full_unstemmed |
Decision tree induction with constrained number of leaf node |
title_sort |
decision tree induction with constrained number of leaf node |
publishDate |
2009 |
url |
http://ndltd.ncl.edu.tw/handle/91700158014594986498 |
work_keys_str_mv |
AT xiangyuyang decisiontreeinductionwithconstrainednumberofleafnode AT yángxiángyǔ decisiontreeinductionwithconstrainednumberofleafnode AT xiangyuyang shùyèjiédiǎnshùmùxiànzhìxiàdejuécèshùjiàngòu AT yángxiángyǔ shùyèjiédiǎnshùmùxiànzhìxiàdejuécèshùjiàngòu |
_version_ |
1718130380822806528 |