A Study on AdaBoost Algorithm

碩士 === 國立清華大學 === 資訊系統與應用研究所 === 102 === Classification is one of the technology in data mining, we can discover patterns and relationships between parameters in data by classification that we can use these patterns to predict unknown data. In the real life, it is applied in several areas. For exa...

Full description

Bibliographic Details
Main Authors: Hsieh, Chih-Yu, 謝志優
Other Authors: Chen, Chaur-Chin
Format: Others
Language:en_US
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/18874159480075931972
id ndltd-TW-102NTHU5394009
record_format oai_dc
spelling ndltd-TW-102NTHU53940092015-10-13T23:37:12Z http://ndltd.ncl.edu.tw/handle/18874159480075931972 A Study on AdaBoost Algorithm 探討AdaBoost演算法 Hsieh, Chih-Yu 謝志優 碩士 國立清華大學 資訊系統與應用研究所 102 Classification is one of the technology in data mining, we can discover patterns and relationships between parameters in data by classification that we can use these patterns to predict unknown data. In the real life, it is applied in several areas. For example, we can discover patterns from the genes of patients by using classification and then it can apply to other patients by using this pattern. Thus, data mining is the most important technology in data analysis. In Big Data, it cannot obtain the information without using data mining. In this thesis, we study the binary and multiclass classification of AdaBoost algorithm. In this algorithm, each sample has a weight value. It uses T weak classifiers to train the training samples. In training weak classifiers, we must change the weight of each incorrectly and correctly classified sample. Finally, the strong classifier is to combine the votes of all individual weak classifiers and then we can use this strong classifier to predict the unknown data. Experiments on colon cancer, breast cancer, 8OX, and Iris data sets are illustrated. Chen, Chaur-Chin 陳朝欽 2014 學位論文 ; thesis 24 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊系統與應用研究所 === 102 === Classification is one of the technology in data mining, we can discover patterns and relationships between parameters in data by classification that we can use these patterns to predict unknown data. In the real life, it is applied in several areas. For example, we can discover patterns from the genes of patients by using classification and then it can apply to other patients by using this pattern. Thus, data mining is the most important technology in data analysis. In Big Data, it cannot obtain the information without using data mining. In this thesis, we study the binary and multiclass classification of AdaBoost algorithm. In this algorithm, each sample has a weight value. It uses T weak classifiers to train the training samples. In training weak classifiers, we must change the weight of each incorrectly and correctly classified sample. Finally, the strong classifier is to combine the votes of all individual weak classifiers and then we can use this strong classifier to predict the unknown data. Experiments on colon cancer, breast cancer, 8OX, and Iris data sets are illustrated.
author2 Chen, Chaur-Chin
author_facet Chen, Chaur-Chin
Hsieh, Chih-Yu
謝志優
author Hsieh, Chih-Yu
謝志優
spellingShingle Hsieh, Chih-Yu
謝志優
A Study on AdaBoost Algorithm
author_sort Hsieh, Chih-Yu
title A Study on AdaBoost Algorithm
title_short A Study on AdaBoost Algorithm
title_full A Study on AdaBoost Algorithm
title_fullStr A Study on AdaBoost Algorithm
title_full_unstemmed A Study on AdaBoost Algorithm
title_sort study on adaboost algorithm
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/18874159480075931972
work_keys_str_mv AT hsiehchihyu astudyonadaboostalgorithm
AT xièzhìyōu astudyonadaboostalgorithm
AT hsiehchihyu tàntǎoadaboostyǎnsuànfǎ
AT xièzhìyōu tàntǎoadaboostyǎnsuànfǎ
AT hsiehchihyu studyonadaboostalgorithm
AT xièzhìyōu studyonadaboostalgorithm
_version_ 1718086675440074752