Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework

碩士 === 國立臺中科技大學 === 資訊工程系碩士班 === 103 === With increased amount data today,it is hard to analyze large data on single computer environment efficiently,the hadoop cluster is very important because we can save and large data by hadoop cluster. Data mining plays an important role of data analysis.Becau...

Full description

Bibliographic Details
Main Authors: Bing-Da Chin, 秦秉達
Other Authors: Hung-Ming Chen
Format: Others
Language:zh-TW
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/fu84aw
id ndltd-TW-103NTTI5392016
record_format oai_dc
spelling ndltd-TW-103NTTI53920162019-09-24T03:34:13Z http://ndltd.ncl.edu.tw/handle/fu84aw Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework 基於Hadoop MapReduce叢集設計平行化二元分類演算法 Bing-Da Chin 秦秉達 碩士 國立臺中科技大學 資訊工程系碩士班 103 With increased amount data today,it is hard to analyze large data on single computer environment efficiently,the hadoop cluster is very important because we can save and large data by hadoop cluster. Data mining plays an important role of data analysis.Because time complexity of the binary-class classification SVM algorithm is a big issue,we design a parallel binary SVM algorithm to slove this problem,and achieve the effect of classifying appropriate data. By leveraging the parallel processing property in MapReduce ,we implement multi-layer binary SVM by MapReduce framework,and run on the hadoop cluster successfully. By designing different parameters of hadoop cluster and using the same data set for training analysis, it shows that the new algorithm can reduce the computation time significantly. Hung-Ming Chen Shih-Ying Chen 陳弘明 陳世穎 2015 學位論文 ; thesis 74 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺中科技大學 === 資訊工程系碩士班 === 103 === With increased amount data today,it is hard to analyze large data on single computer environment efficiently,the hadoop cluster is very important because we can save and large data by hadoop cluster. Data mining plays an important role of data analysis.Because time complexity of the binary-class classification SVM algorithm is a big issue,we design a parallel binary SVM algorithm to slove this problem,and achieve the effect of classifying appropriate data. By leveraging the parallel processing property in MapReduce ,we implement multi-layer binary SVM by MapReduce framework,and run on the hadoop cluster successfully. By designing different parameters of hadoop cluster and using the same data set for training analysis, it shows that the new algorithm can reduce the computation time significantly.
author2 Hung-Ming Chen
author_facet Hung-Ming Chen
Bing-Da Chin
秦秉達
author Bing-Da Chin
秦秉達
spellingShingle Bing-Da Chin
秦秉達
Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework
author_sort Bing-Da Chin
title Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework
title_short Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework
title_full Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework
title_fullStr Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework
title_full_unstemmed Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework
title_sort design of parallel binary classification algorithm based on hadoop cluster with mapreduce framework
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/fu84aw
work_keys_str_mv AT bingdachin designofparallelbinaryclassificationalgorithmbasedonhadoopclusterwithmapreduceframework
AT qínbǐngdá designofparallelbinaryclassificationalgorithmbasedonhadoopclusterwithmapreduceframework
AT bingdachin jīyúhadoopmapreducecóngjíshèjìpíngxínghuàèryuánfēnlèiyǎnsuànfǎ
AT qínbǐngdá jīyúhadoopmapreducecóngjíshèjìpíngxínghuàèryuánfēnlèiyǎnsuànfǎ
_version_ 1719256450485714944