A Weighting Agglomerative All-means Clustering Algorithmand Its Applications

博士 === 元智大學 === 工業工程與管理學系 === 98 === Given a dataset consisting of numerous objects, clustering aims at partitioning the dataset into several homogeneous clusters based on the similarities/dissimilarities among objects. Partitional clustering methods are the most popular and widespread clustering me...

Full description

Bibliographic Details
Main Authors:	Chuang-Cheng Chiu, 邱創政
Other Authors:	Chieh-Yuan Tsai
Format:	Others
Language:	en_US
Published:	2010
Online Access:	http://ndltd.ncl.edu.tw/handle/03573328242434194250

id	ndltd-TW-098YZU05031086
record_format	oai_dc
spelling	ndltd-TW-098YZU050310862015-10-13T18:20:56Z http://ndltd.ncl.edu.tw/handle/03573328242434194250 A Weighting Agglomerative All-means Clustering Algorithmand Its Applications 一個屬性加權之自我凝聚分群法及其應用 Chuang-Cheng Chiu 邱創政博士元智大學工業工程與管理學系 98 Given a dataset consisting of numerous objects, clustering aims at partitioning the dataset into several homogeneous clusters based on the similarities/dissimilarities among objects. Partitional clustering methods are the most popular and widespread clustering methods due to their computational efficiency and easy manipulation. However, users still encounter several unavoidable challenges when employing partitional clustering methods, including specifying a proper dissimilarity measure, setting laden parameters, and obtaining the result sensitive to initialization of cluster centers. In this dissertation, a novel partitional clustering method entitled Weighting Agglomerative All-means clustering algorithm (WAA-means) is proposed to simultaneously fulfill these requisitions. The elementary idea of WAA-means is derived from how to move object locations for effective clustering based on the concept of gravitational force. Each object in a dataset is regarded as an independent individual. For any two objects, there is an attraction existing between them and the attraction activates them to be close to each other. Anytime each object should be affected by different attractions from other objects simultaneously, so that it accordingly moves to a new location determined based on the combination of these attractions. The new location of an object will be deeply influenced by the location of another object similar to this object because the attraction between them is large. As time goes by, the objects similar to each other will progressively move to approach together and eventually agglomerate at a single location. Through the WAA-means algorithm, consequently, all objects initially distributed around are placed at a few agglomerated locations finally. At the moment, each object will not move any more because it not only has the maximal attractions with the objects at the same agglomerated location but also has extremely weak attractions with the objects at different agglomerated locations. Therefore, the objects that will be agglomerated at an identical location can be classified to the same cluster. The number of clusters also can be determined by counting the number of final agglomerated locations, rather than being assigned before performing the WAA-means algorithm. The details about the WAA-means algorithm are elucidated, including defining the attraction between two objects, developing a mechanism to measure the influence weights of attractions that act on an object, expounding the whole computational procedures of WAA-means, and analyzing the convergence and robustness of WAA-means. Furthermore, with the advancement of information technology, the fast-growing, tremendous amount of data has been collected in many application domains over past several decades. For maintaining the efficiency of the proposed WAA-means algorithm when handling such a large dataset, a data summarization procedure is presented in order to reduce the data size of inputs of the WAA-means algorithm. Finally, we will apply the WAA-means algorithm to two practical applications, including case-based reasoning and gene microarray biclustering, for improving their performances. Chieh-Yuan Tsai 蔡介元 2010 學位論文 ; thesis 136 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	博士 === 元智大學 === 工業工程與管理學系 === 98 === Given a dataset consisting of numerous objects, clustering aims at partitioning the dataset into several homogeneous clusters based on the similarities/dissimilarities among objects. Partitional clustering methods are the most popular and widespread clustering methods due to their computational efficiency and easy manipulation. However, users still encounter several unavoidable challenges when employing partitional clustering methods, including specifying a proper dissimilarity measure, setting laden parameters, and obtaining the result sensitive to initialization of cluster centers. In this dissertation, a novel partitional clustering method entitled Weighting Agglomerative All-means clustering algorithm (WAA-means) is proposed to simultaneously fulfill these requisitions. The elementary idea of WAA-means is derived from how to move object locations for effective clustering based on the concept of gravitational force. Each object in a dataset is regarded as an independent individual. For any two objects, there is an attraction existing between them and the attraction activates them to be close to each other. Anytime each object should be affected by different attractions from other objects simultaneously, so that it accordingly moves to a new location determined based on the combination of these attractions. The new location of an object will be deeply influenced by the location of another object similar to this object because the attraction between them is large. As time goes by, the objects similar to each other will progressively move to approach together and eventually agglomerate at a single location. Through the WAA-means algorithm, consequently, all objects initially distributed around are placed at a few agglomerated locations finally. At the moment, each object will not move any more because it not only has the maximal attractions with the objects at the same agglomerated location but also has extremely weak attractions with the objects at different agglomerated locations. Therefore, the objects that will be agglomerated at an identical location can be classified to the same cluster. The number of clusters also can be determined by counting the number of final agglomerated locations, rather than being assigned before performing the WAA-means algorithm. The details about the WAA-means algorithm are elucidated, including defining the attraction between two objects, developing a mechanism to measure the influence weights of attractions that act on an object, expounding the whole computational procedures of WAA-means, and analyzing the convergence and robustness of WAA-means. Furthermore, with the advancement of information technology, the fast-growing, tremendous amount of data has been collected in many application domains over past several decades. For maintaining the efficiency of the proposed WAA-means algorithm when handling such a large dataset, a data summarization procedure is presented in order to reduce the data size of inputs of the WAA-means algorithm. Finally, we will apply the WAA-means algorithm to two practical applications, including case-based reasoning and gene microarray biclustering, for improving their performances.
author2	Chieh-Yuan Tsai
author_facet	Chieh-Yuan Tsai Chuang-Cheng Chiu 邱創政
author	Chuang-Cheng Chiu 邱創政
spellingShingle	Chuang-Cheng Chiu 邱創政 A Weighting Agglomerative All-means Clustering Algorithmand Its Applications
author_sort	Chuang-Cheng Chiu
title	A Weighting Agglomerative All-means Clustering Algorithmand Its Applications
title_short	A Weighting Agglomerative All-means Clustering Algorithmand Its Applications
title_full	A Weighting Agglomerative All-means Clustering Algorithmand Its Applications
title_fullStr	A Weighting Agglomerative All-means Clustering Algorithmand Its Applications
title_full_unstemmed	A Weighting Agglomerative All-means Clustering Algorithmand Its Applications
title_sort	weighting agglomerative all-means clustering algorithmand its applications
publishDate	2010
url	http://ndltd.ncl.edu.tw/handle/03573328242434194250
work_keys_str_mv	AT chuangchengchiu aweightingagglomerativeallmeansclusteringalgorithmanditsapplications AT qiūchuàngzhèng aweightingagglomerativeallmeansclusteringalgorithmanditsapplications AT chuangchengchiu yīgèshǔxìngjiāquánzhīzìwǒníngjùfēnqúnfǎjíqíyīngyòng AT qiūchuàngzhèng yīgèshǔxìngjiāquánzhīzìwǒníngjùfēnqúnfǎjíqíyīngyòng AT chuangchengchiu weightingagglomerativeallmeansclusteringalgorithmanditsapplications AT qiūchuàngzhèng weightingagglomerativeallmeansclusteringalgorithmanditsapplications
_version_	1718030088248754176

A Weighting Agglomerative All-means Clustering Algorithmand Its Applications

Similar Items