Feature-Based Ensemble Clustering with Indian Buffet Process

碩士 === 國立交通大學 === 資訊科學與工程研究所 === 104 === As the development of technology, the amount of data grows exponentially. This makes data clustering more and more important, since clustering is an important technique in data exploration. Clustering is an unsupervised learning method, so improving performan...

Full description

Bibliographic Details
Main Authors: WEI, XINYU, 魏新宇
Other Authors: Lee, Chia-Hoang
Format: Others
Language:en_US
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/ezd6uf
id ndltd-TW-104NCTU5394111
record_format oai_dc
spelling ndltd-TW-104NCTU53941112019-05-15T23:08:42Z http://ndltd.ncl.edu.tw/handle/ezd6uf Feature-Based Ensemble Clustering with Indian Buffet Process 基於印度餐廳過程的特征抽取整合分群法 WEI, XINYU 魏新宇 碩士 國立交通大學 資訊科學與工程研究所 104 As the development of technology, the amount of data grows exponentially. This makes data clustering more and more important, since clustering is an important technique in data exploration. Clustering is an unsupervised learning method, so improving performance and obtaining robust clustering results are challenging tasks in machine learning. Moreover, specifying the number of clusters in another problem for a certain class of clustering algorithms. Previous studies have shown that ensemble learning considers many clustering methods and aggregates their results, which can always yield a better and more robust result than a single one. This thesis proposes a feature-based ensemble clustering model based on the Indian Buffet Process(IBP). Additionally, the proposed model does not need to know the number of clusters in advance, and obtain the most suitable one for the data during the process of clustering. The proposed method uses quality and diversity as performance criteria to select feature subsets based on IBP and the proposed greedy algorithm. Each feature subset is considered as a view of the data and each subset results in ten clustering results. The final clustering result is the aggregation of these results by using the proposed aggregation algorithm. The experimental results indicate that the proposed model generally outperforms other unsupervised methods. Lee, Chia-Hoang Liu, Chien-Liang Chuang, Jen-Hui 李嘉晃 劉建良 莊仁輝 2016 學位論文 ; thesis 44 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 資訊科學與工程研究所 === 104 === As the development of technology, the amount of data grows exponentially. This makes data clustering more and more important, since clustering is an important technique in data exploration. Clustering is an unsupervised learning method, so improving performance and obtaining robust clustering results are challenging tasks in machine learning. Moreover, specifying the number of clusters in another problem for a certain class of clustering algorithms. Previous studies have shown that ensemble learning considers many clustering methods and aggregates their results, which can always yield a better and more robust result than a single one. This thesis proposes a feature-based ensemble clustering model based on the Indian Buffet Process(IBP). Additionally, the proposed model does not need to know the number of clusters in advance, and obtain the most suitable one for the data during the process of clustering. The proposed method uses quality and diversity as performance criteria to select feature subsets based on IBP and the proposed greedy algorithm. Each feature subset is considered as a view of the data and each subset results in ten clustering results. The final clustering result is the aggregation of these results by using the proposed aggregation algorithm. The experimental results indicate that the proposed model generally outperforms other unsupervised methods.
author2 Lee, Chia-Hoang
author_facet Lee, Chia-Hoang
WEI, XINYU
魏新宇
author WEI, XINYU
魏新宇
spellingShingle WEI, XINYU
魏新宇
Feature-Based Ensemble Clustering with Indian Buffet Process
author_sort WEI, XINYU
title Feature-Based Ensemble Clustering with Indian Buffet Process
title_short Feature-Based Ensemble Clustering with Indian Buffet Process
title_full Feature-Based Ensemble Clustering with Indian Buffet Process
title_fullStr Feature-Based Ensemble Clustering with Indian Buffet Process
title_full_unstemmed Feature-Based Ensemble Clustering with Indian Buffet Process
title_sort feature-based ensemble clustering with indian buffet process
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/ezd6uf
work_keys_str_mv AT weixinyu featurebasedensembleclusteringwithindianbuffetprocess
AT wèixīnyǔ featurebasedensembleclusteringwithindianbuffetprocess
AT weixinyu jīyúyìndùcāntīngguòchéngdetèzhēngchōuqǔzhěnghéfēnqúnfǎ
AT wèixīnyǔ jīyúyìndùcāntīngguòchéngdetèzhēngchōuqǔzhěnghéfēnqúnfǎ
_version_ 1719140532182056960