Decision tree induction with constrained number of leaf node

碩士 === 國立中央大學 === 資訊管理研究所 === 97 === Classification, which builds a data classification model based on attribute value and label of existing data, is a very widespread data mining technology. Decision tree is one of the most popular classification technologies, because it is easy to understand and...

Full description

Bibliographic Details
Main Authors: Xiang-Yu Yang, 楊翔宇
Other Authors: Yan-liang Chen
Format: Others
Language:zh-TW
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/91700158014594986498
id ndltd-TW-097NCU05396083
record_format oai_dc
spelling ndltd-TW-097NCU053960832015-11-16T16:08:55Z http://ndltd.ncl.edu.tw/handle/91700158014594986498 Decision tree induction with constrained number of leaf node 樹葉節點數目限制下的決策樹建構 Xiang-Yu Yang 楊翔宇 碩士 國立中央大學 資訊管理研究所 97 Classification, which builds a data classification model based on attribute value and label of existing data, is a very widespread data mining technology. Decision tree is one of the most popular classification technologies, because it is easy to understand and has the high efficiency computing. Decision tree is widely applied to signal classification, expert system, and medical diagnosis. Because of the noise data and special case of training data sets, decision tree is always huge and it contains too many branches and rules which are difficult to understand. This shortcoming reduces the availability of decision tree. Therefore, we reduce rules from a decision tree by limiting the number of leaf nodes of the decision tree and achieve the highest accuracy with the number of leaf nodes given by user. For this purpose, we propose a new algorithm. We use the agglomerative approach of the hierarchical clustering to limit the decision tree to binary tree by combining the branches of decision tree. Experiment results show that compared with the C4.5, the proposed algorithm successfully reduces the number of leaf nodes and makes better accuracy. Yan-liang Chen 陳彥良 2009 學位論文 ; thesis 44 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中央大學 === 資訊管理研究所 === 97 === Classification, which builds a data classification model based on attribute value and label of existing data, is a very widespread data mining technology. Decision tree is one of the most popular classification technologies, because it is easy to understand and has the high efficiency computing. Decision tree is widely applied to signal classification, expert system, and medical diagnosis. Because of the noise data and special case of training data sets, decision tree is always huge and it contains too many branches and rules which are difficult to understand. This shortcoming reduces the availability of decision tree. Therefore, we reduce rules from a decision tree by limiting the number of leaf nodes of the decision tree and achieve the highest accuracy with the number of leaf nodes given by user. For this purpose, we propose a new algorithm. We use the agglomerative approach of the hierarchical clustering to limit the decision tree to binary tree by combining the branches of decision tree. Experiment results show that compared with the C4.5, the proposed algorithm successfully reduces the number of leaf nodes and makes better accuracy.
author2 Yan-liang Chen
author_facet Yan-liang Chen
Xiang-Yu Yang
楊翔宇
author Xiang-Yu Yang
楊翔宇
spellingShingle Xiang-Yu Yang
楊翔宇
Decision tree induction with constrained number of leaf node
author_sort Xiang-Yu Yang
title Decision tree induction with constrained number of leaf node
title_short Decision tree induction with constrained number of leaf node
title_full Decision tree induction with constrained number of leaf node
title_fullStr Decision tree induction with constrained number of leaf node
title_full_unstemmed Decision tree induction with constrained number of leaf node
title_sort decision tree induction with constrained number of leaf node
publishDate 2009
url http://ndltd.ncl.edu.tw/handle/91700158014594986498
work_keys_str_mv AT xiangyuyang decisiontreeinductionwithconstrainednumberofleafnode
AT yángxiángyǔ decisiontreeinductionwithconstrainednumberofleafnode
AT xiangyuyang shùyèjiédiǎnshùmùxiànzhìxiàdejuécèshùjiàngòu
AT yángxiángyǔ shùyèjiédiǎnshùmùxiànzhìxiàdejuécèshùjiàngòu
_version_ 1718130380822806528