Construction of High-Efficiency Decision Tree with Principal Component Analysis

碩士 === 臺中技術學院 === 流通管理系碩士班 === 98 ===   With the advance of technology and popularity of the Internet, the amount of information around the world has grown geometrically. In response to the rapid growth of the information, we have huge database manage systems in fields for different applications. To...

Full description

Bibliographic Details
Main Authors: Yi-Ya Chang, 張怡雅
Other Authors: Hung-Yi Lin
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/rbs4c4
id ndltd-TW-098NTTI5691011
record_format oai_dc
spelling ndltd-TW-098NTTI56910112019-09-24T03:34:02Z http://ndltd.ncl.edu.tw/handle/rbs4c4 Construction of High-Efficiency Decision Tree with Principal Component Analysis 以主成份分析建構高效率決策樹 Yi-Ya Chang 張怡雅 碩士 臺中技術學院 流通管理系碩士班 98   With the advance of technology and popularity of the Internet, the amount of information around the world has grown geometrically. In response to the rapid growth of the information, we have huge database manage systems in fields for different applications. To extract important knowledge relies on efficient data mining techniques. Among data mining techniques, decision tree is an important tool that is possible to identity existing causal relationships. Traditional decision tree uses univariate attributes to classify the data, and further constructs a classification model which is usually huge. However, due to the neglect of the correlation between feature attributes, it may result in low efficiency of inductive learning by using similar classification rules repeatedly. In order to improve the efficiency of classification, the study proposes a strategy which adapts PCA (principal component analysis) to simplify the classification. By the communality and explanation resulted from PCA, we can decide an appropriate set of feature attributes. Therefore, a multivariate classifier is produced. We then use this multivariate hybrid attribute for the root of the constructed decision tree. Finally, the UCI database is used to evaluate the method of the study. A comparison between proposed method (multivariate hybrid attributes) and traditional C4.5 (univariate attribute) is made as well. Hung-Yi Lin 林泓毅 2010 學位論文 ; thesis 57 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 臺中技術學院 === 流通管理系碩士班 === 98 ===   With the advance of technology and popularity of the Internet, the amount of information around the world has grown geometrically. In response to the rapid growth of the information, we have huge database manage systems in fields for different applications. To extract important knowledge relies on efficient data mining techniques. Among data mining techniques, decision tree is an important tool that is possible to identity existing causal relationships. Traditional decision tree uses univariate attributes to classify the data, and further constructs a classification model which is usually huge. However, due to the neglect of the correlation between feature attributes, it may result in low efficiency of inductive learning by using similar classification rules repeatedly. In order to improve the efficiency of classification, the study proposes a strategy which adapts PCA (principal component analysis) to simplify the classification. By the communality and explanation resulted from PCA, we can decide an appropriate set of feature attributes. Therefore, a multivariate classifier is produced. We then use this multivariate hybrid attribute for the root of the constructed decision tree. Finally, the UCI database is used to evaluate the method of the study. A comparison between proposed method (multivariate hybrid attributes) and traditional C4.5 (univariate attribute) is made as well.
author2 Hung-Yi Lin
author_facet Hung-Yi Lin
Yi-Ya Chang
張怡雅
author Yi-Ya Chang
張怡雅
spellingShingle Yi-Ya Chang
張怡雅
Construction of High-Efficiency Decision Tree with Principal Component Analysis
author_sort Yi-Ya Chang
title Construction of High-Efficiency Decision Tree with Principal Component Analysis
title_short Construction of High-Efficiency Decision Tree with Principal Component Analysis
title_full Construction of High-Efficiency Decision Tree with Principal Component Analysis
title_fullStr Construction of High-Efficiency Decision Tree with Principal Component Analysis
title_full_unstemmed Construction of High-Efficiency Decision Tree with Principal Component Analysis
title_sort construction of high-efficiency decision tree with principal component analysis
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/rbs4c4
work_keys_str_mv AT yiyachang constructionofhighefficiencydecisiontreewithprincipalcomponentanalysis
AT zhāngyíyǎ constructionofhighefficiencydecisiontreewithprincipalcomponentanalysis
AT yiyachang yǐzhǔchéngfènfēnxījiàngòugāoxiàolǜjuécèshù
AT zhāngyíyǎ yǐzhǔchéngfènfēnxījiàngòugāoxiàolǜjuécèshù
_version_ 1719256052302610432