Feature-Based Ensemble Clustering with Indian Buffet Process
碩士 === 國立交通大學 === 資訊科學與工程研究所 === 104 === As the development of technology, the amount of data grows exponentially. This makes data clustering more and more important, since clustering is an important technique in data exploration. Clustering is an unsupervised learning method, so improving performan...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/ezd6uf |
id |
ndltd-TW-104NCTU5394111 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104NCTU53941112019-05-15T23:08:42Z http://ndltd.ncl.edu.tw/handle/ezd6uf Feature-Based Ensemble Clustering with Indian Buffet Process 基於印度餐廳過程的特征抽取整合分群法 WEI, XINYU 魏新宇 碩士 國立交通大學 資訊科學與工程研究所 104 As the development of technology, the amount of data grows exponentially. This makes data clustering more and more important, since clustering is an important technique in data exploration. Clustering is an unsupervised learning method, so improving performance and obtaining robust clustering results are challenging tasks in machine learning. Moreover, specifying the number of clusters in another problem for a certain class of clustering algorithms. Previous studies have shown that ensemble learning considers many clustering methods and aggregates their results, which can always yield a better and more robust result than a single one. This thesis proposes a feature-based ensemble clustering model based on the Indian Buffet Process(IBP). Additionally, the proposed model does not need to know the number of clusters in advance, and obtain the most suitable one for the data during the process of clustering. The proposed method uses quality and diversity as performance criteria to select feature subsets based on IBP and the proposed greedy algorithm. Each feature subset is considered as a view of the data and each subset results in ten clustering results. The final clustering result is the aggregation of these results by using the proposed aggregation algorithm. The experimental results indicate that the proposed model generally outperforms other unsupervised methods. Lee, Chia-Hoang Liu, Chien-Liang Chuang, Jen-Hui 李嘉晃 劉建良 莊仁輝 2016 學位論文 ; thesis 44 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 資訊科學與工程研究所 === 104 === As the development of technology, the amount of data grows exponentially. This makes data clustering more and more important, since clustering is an important technique in data exploration.
Clustering is an unsupervised learning method, so improving performance and obtaining robust clustering results are challenging tasks in machine learning. Moreover, specifying the number of clusters in another problem for a certain class of clustering algorithms. Previous studies have shown that ensemble learning considers many clustering methods and aggregates their results, which can always yield a better and more robust result than a single one. This thesis proposes a feature-based ensemble clustering model based on the Indian Buffet Process(IBP). Additionally, the proposed model does not need to know the number of clusters in advance, and obtain the most suitable one for the data during the process of clustering. The proposed method uses quality and diversity as performance criteria to select feature subsets based on IBP and the proposed greedy algorithm. Each feature subset is considered as a view of the data and each subset results in ten clustering results. The final clustering result is the aggregation of these results by using the proposed aggregation algorithm. The experimental results indicate that the proposed model generally outperforms other unsupervised methods.
|
author2 |
Lee, Chia-Hoang |
author_facet |
Lee, Chia-Hoang WEI, XINYU 魏新宇 |
author |
WEI, XINYU 魏新宇 |
spellingShingle |
WEI, XINYU 魏新宇 Feature-Based Ensemble Clustering with Indian Buffet Process |
author_sort |
WEI, XINYU |
title |
Feature-Based Ensemble Clustering with Indian Buffet Process |
title_short |
Feature-Based Ensemble Clustering with Indian Buffet Process |
title_full |
Feature-Based Ensemble Clustering with Indian Buffet Process |
title_fullStr |
Feature-Based Ensemble Clustering with Indian Buffet Process |
title_full_unstemmed |
Feature-Based Ensemble Clustering with Indian Buffet Process |
title_sort |
feature-based ensemble clustering with indian buffet process |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/ezd6uf |
work_keys_str_mv |
AT weixinyu featurebasedensembleclusteringwithindianbuffetprocess AT wèixīnyǔ featurebasedensembleclusteringwithindianbuffetprocess AT weixinyu jīyúyìndùcāntīngguòchéngdetèzhēngchōuqǔzhěnghéfēnqúnfǎ AT wèixīnyǔ jīyúyìndùcāntīngguòchéngdetèzhēngchōuqǔzhěnghéfēnqúnfǎ |
_version_ |
1719140532182056960 |