Decision Tree Construction for Data Mining on Grid Computing

碩士 === 東海大學 === 資訊工程與科學系碩士在職專班 === 92 === Decision tree is one of the frequently used methods in data mining for searching prediction information. Due to its characteristics which are suitable for parallelism, it has been widely adopted in high performance field and developed into various...

Full description

Bibliographic Details
Main Authors: Shu-Tzu Tsai, 蔡淑姿
Other Authors: Chao-Tung Yang
Format: Others
Language:en_US
Published: 2004
Online Access:http://ndltd.ncl.edu.tw/handle/24542453771540632142
id ndltd-TW-092THU00392003
record_format oai_dc
spelling ndltd-TW-092THU003920032016-06-15T04:17:50Z http://ndltd.ncl.edu.tw/handle/24542453771540632142 Decision Tree Construction for Data Mining on Grid Computing 在格網計算上用於資料探勘之決策樹建構 Shu-Tzu Tsai 蔡淑姿 碩士 東海大學 資訊工程與科學系碩士在職專班 92 Decision tree is one of the frequently used methods in data mining for searching prediction information. Due to its characteristics which are suitable for parallelism, it has been widely adopted in high performance field and developed into various parallel decision tree algorithms to deal with huge data and complex computation. Following the development of other technology fields, Grid computing is regarded as the extension of PC Cluster and therefore it future research development is highly valued. This new wave of internet application is the 3rd generation of internet applications following the traditional internet and Web application. In this thesis, we have presented the Grid-based decision tree architecture, and hope it can be applied on both parallel and sequential algorithms for the decision tree applications. Also, based on the scope and model of data mining applied in the Grid environment as well as user equivalent perspective, Grid roles can be categorized into three types. We are hoping that through these definitions, software developers can define clear system processes and differentiate the application scope for software applications. To fulfill our architecture, we first apply an existing parallel decision tree algorithm-SPRINT algorithm in the Grid environment. The performance and differences in many other areas are compared using different sizes of dataset. The experimental results will be used for future reference and further development. Chao-Tung Yang 楊朝棟 2004 學位論文 ; thesis 57 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 東海大學 === 資訊工程與科學系碩士在職專班 === 92 === Decision tree is one of the frequently used methods in data mining for searching prediction information. Due to its characteristics which are suitable for parallelism, it has been widely adopted in high performance field and developed into various parallel decision tree algorithms to deal with huge data and complex computation. Following the development of other technology fields, Grid computing is regarded as the extension of PC Cluster and therefore it future research development is highly valued. This new wave of internet application is the 3rd generation of internet applications following the traditional internet and Web application. In this thesis, we have presented the Grid-based decision tree architecture, and hope it can be applied on both parallel and sequential algorithms for the decision tree applications. Also, based on the scope and model of data mining applied in the Grid environment as well as user equivalent perspective, Grid roles can be categorized into three types. We are hoping that through these definitions, software developers can define clear system processes and differentiate the application scope for software applications. To fulfill our architecture, we first apply an existing parallel decision tree algorithm-SPRINT algorithm in the Grid environment. The performance and differences in many other areas are compared using different sizes of dataset. The experimental results will be used for future reference and further development.
author2 Chao-Tung Yang
author_facet Chao-Tung Yang
Shu-Tzu Tsai
蔡淑姿
author Shu-Tzu Tsai
蔡淑姿
spellingShingle Shu-Tzu Tsai
蔡淑姿
Decision Tree Construction for Data Mining on Grid Computing
author_sort Shu-Tzu Tsai
title Decision Tree Construction for Data Mining on Grid Computing
title_short Decision Tree Construction for Data Mining on Grid Computing
title_full Decision Tree Construction for Data Mining on Grid Computing
title_fullStr Decision Tree Construction for Data Mining on Grid Computing
title_full_unstemmed Decision Tree Construction for Data Mining on Grid Computing
title_sort decision tree construction for data mining on grid computing
publishDate 2004
url http://ndltd.ncl.edu.tw/handle/24542453771540632142
work_keys_str_mv AT shutzutsai decisiontreeconstructionfordataminingongridcomputing
AT càishūzī decisiontreeconstructionfordataminingongridcomputing
AT shutzutsai zàigéwǎngjìsuànshàngyòngyúzīliàotànkānzhījuécèshùjiàngòu
AT càishūzī zàigéwǎngjìsuànshàngyòngyúzīliàotànkānzhījuécèshùjiàngòu
_version_ 1718306262318317568