A Study on AdaBoost Algorithm
碩士 === 國立清華大學 === 資訊系統與應用研究所 === 102 === Classification is one of the technology in data mining, we can discover patterns and relationships between parameters in data by classification that we can use these patterns to predict unknown data. In the real life, it is applied in several areas. For exa...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2014
|
Online Access: | http://ndltd.ncl.edu.tw/handle/18874159480075931972 |
id |
ndltd-TW-102NTHU5394009 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-102NTHU53940092015-10-13T23:37:12Z http://ndltd.ncl.edu.tw/handle/18874159480075931972 A Study on AdaBoost Algorithm 探討AdaBoost演算法 Hsieh, Chih-Yu 謝志優 碩士 國立清華大學 資訊系統與應用研究所 102 Classification is one of the technology in data mining, we can discover patterns and relationships between parameters in data by classification that we can use these patterns to predict unknown data. In the real life, it is applied in several areas. For example, we can discover patterns from the genes of patients by using classification and then it can apply to other patients by using this pattern. Thus, data mining is the most important technology in data analysis. In Big Data, it cannot obtain the information without using data mining. In this thesis, we study the binary and multiclass classification of AdaBoost algorithm. In this algorithm, each sample has a weight value. It uses T weak classifiers to train the training samples. In training weak classifiers, we must change the weight of each incorrectly and correctly classified sample. Finally, the strong classifier is to combine the votes of all individual weak classifiers and then we can use this strong classifier to predict the unknown data. Experiments on colon cancer, breast cancer, 8OX, and Iris data sets are illustrated. Chen, Chaur-Chin 陳朝欽 2014 學位論文 ; thesis 24 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊系統與應用研究所 === 102 === Classification is one of the technology in data mining, we can discover patterns and relationships between parameters in data by classification that we can use these patterns to predict unknown data. In the real life, it is applied in several areas. For example, we can discover patterns from the genes of patients by using classification and then it can apply to other patients by using this pattern. Thus, data mining is the most important technology in data analysis. In Big Data, it cannot obtain the information without using data mining.
In this thesis, we study the binary and multiclass classification of AdaBoost algorithm. In this algorithm, each sample has a weight value. It uses T weak classifiers to train the training samples. In training weak classifiers, we must change the weight of each incorrectly and correctly classified sample. Finally, the strong classifier is to combine the votes of all individual weak classifiers and then we can use this strong classifier to predict the unknown data. Experiments on colon cancer, breast cancer, 8OX, and Iris data sets are illustrated.
|
author2 |
Chen, Chaur-Chin |
author_facet |
Chen, Chaur-Chin Hsieh, Chih-Yu 謝志優 |
author |
Hsieh, Chih-Yu 謝志優 |
spellingShingle |
Hsieh, Chih-Yu 謝志優 A Study on AdaBoost Algorithm |
author_sort |
Hsieh, Chih-Yu |
title |
A Study on AdaBoost Algorithm |
title_short |
A Study on AdaBoost Algorithm |
title_full |
A Study on AdaBoost Algorithm |
title_fullStr |
A Study on AdaBoost Algorithm |
title_full_unstemmed |
A Study on AdaBoost Algorithm |
title_sort |
study on adaboost algorithm |
publishDate |
2014 |
url |
http://ndltd.ncl.edu.tw/handle/18874159480075931972 |
work_keys_str_mv |
AT hsiehchihyu astudyonadaboostalgorithm AT xièzhìyōu astudyonadaboostalgorithm AT hsiehchihyu tàntǎoadaboostyǎnsuànfǎ AT xièzhìyōu tàntǎoadaboostyǎnsuànfǎ AT hsiehchihyu studyonadaboostalgorithm AT xièzhìyōu studyonadaboostalgorithm |
_version_ |
1718086675440074752 |