Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework
碩士 === 國立臺中科技大學 === 資訊工程系碩士班 === 103 === With increased amount data today,it is hard to analyze large data on single computer environment efficiently,the hadoop cluster is very important because we can save and large data by hadoop cluster. Data mining plays an important role of data analysis.Becau...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/fu84aw |
id |
ndltd-TW-103NTTI5392016 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NTTI53920162019-09-24T03:34:13Z http://ndltd.ncl.edu.tw/handle/fu84aw Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework 基於Hadoop MapReduce叢集設計平行化二元分類演算法 Bing-Da Chin 秦秉達 碩士 國立臺中科技大學 資訊工程系碩士班 103 With increased amount data today,it is hard to analyze large data on single computer environment efficiently,the hadoop cluster is very important because we can save and large data by hadoop cluster. Data mining plays an important role of data analysis.Because time complexity of the binary-class classification SVM algorithm is a big issue,we design a parallel binary SVM algorithm to slove this problem,and achieve the effect of classifying appropriate data. By leveraging the parallel processing property in MapReduce ,we implement multi-layer binary SVM by MapReduce framework,and run on the hadoop cluster successfully. By designing different parameters of hadoop cluster and using the same data set for training analysis, it shows that the new algorithm can reduce the computation time significantly. Hung-Ming Chen Shih-Ying Chen 陳弘明 陳世穎 2015 學位論文 ; thesis 74 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺中科技大學 === 資訊工程系碩士班 === 103 === With increased amount data today,it is hard to analyze large data on single computer environment efficiently,the hadoop cluster is very important because we can save and large data by hadoop cluster. Data mining plays an important role of data analysis.Because time complexity of the binary-class classification SVM algorithm is a big issue,we design a parallel binary SVM algorithm to slove this problem,and achieve the effect of classifying appropriate data.
By leveraging the parallel processing property in MapReduce ,we implement multi-layer binary SVM by MapReduce framework,and run on the hadoop cluster successfully. By designing different parameters of hadoop cluster and using the same data set for training analysis, it shows that the new algorithm can reduce the computation time significantly.
|
author2 |
Hung-Ming Chen |
author_facet |
Hung-Ming Chen Bing-Da Chin 秦秉達 |
author |
Bing-Da Chin 秦秉達 |
spellingShingle |
Bing-Da Chin 秦秉達 Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework |
author_sort |
Bing-Da Chin |
title |
Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework |
title_short |
Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework |
title_full |
Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework |
title_fullStr |
Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework |
title_full_unstemmed |
Design of Parallel Binary Classification Algorithm Based on Hadoop Cluster with MapReduce Framework |
title_sort |
design of parallel binary classification algorithm based on hadoop cluster with mapreduce framework |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/fu84aw |
work_keys_str_mv |
AT bingdachin designofparallelbinaryclassificationalgorithmbasedonhadoopclusterwithmapreduceframework AT qínbǐngdá designofparallelbinaryclassificationalgorithmbasedonhadoopclusterwithmapreduceframework AT bingdachin jīyúhadoopmapreducecóngjíshèjìpíngxínghuàèryuánfēnlèiyǎnsuànfǎ AT qínbǐngdá jīyúhadoopmapreducecóngjíshèjìpíngxínghuàèryuánfēnlèiyǎnsuànfǎ |
_version_ |
1719256450485714944 |