A Two-staged Clustering Algorithm with Multiple Scales

碩士 === 元智大學 === 資訊管理研究所 === 91 === Cluster analysis is a kind of data mining techniques, and its goal is to find the hidden patterns from the data. In related studies, most of the reseachera use equal weight to cluster data and only use metric calculation to deal with four kinds of scales .We believ...

Full description

Bibliographic Details
Main Authors:	Rung-Ting Chien, 簡榮廷
Other Authors:	Chien-Lung Chan
Format:	Others
Language:	zh-TW
Published:	2003
Online Access:	http://ndltd.ncl.edu.tw/handle/15988641902381400969

id	ndltd-TW-091YZU00396029
record_format	oai_dc
spelling	ndltd-TW-091YZU003960292015-10-13T13:39:20Z http://ndltd.ncl.edu.tw/handle/15988641902381400969 A Two-staged Clustering Algorithm with Multiple Scales 多尺度資料的二階段分群演算法 Rung-Ting Chien 簡榮廷碩士元智大學資訊管理研究所 91 Cluster analysis is a kind of data mining techniques, and its goal is to find the hidden patterns from the data. In related studies, most of the reseachera use equal weight to cluster data and only use metric calculation to deal with four kinds of scales .We believe traditional clustering algorithm can be incorporated with expert''s subjective judgment. And different scales -- Nominal, Ordinal, Interval and Ratio, should have different methods to calculate the degree of similarity. So we try to combine expert''s weight and multi-scale into clustering process. Our purpose is to solve the problems that clustering result is hard to explain and result can''t meet the decision marker''s need. In this paper, we propose a two-staged clustering algorithm to solve these problems. In the first-staged, we use the training data to find some parameters that can improve our cluster quality. And we cluster all data and these parameters in the second-staged. In our algorithm, we use multi-scales and unequal weight to calculate all kinds of data and use four standard data sets (Wisconsin Breast Cancer Data, Contraceptive Method Choice Data, Iris Education Data and Balance Scale Weight & Distance Data) to test our algorithm. In the end we find better quality of clustering results in using multi-scale and better prediction with expert''s weight, we find two conclusions in our experiments. First, clustering use multiple scale calculation can improve the quality of similarity within group and dissimilarity between groups. Second, clustering with expert''s weight has better prediction than clustering with equal weight. So we believe multi-scales with expert''s weight clustering algorithm can not only improve clustering quality but also meets decision marker''s requirement Chien-Lung Chan 詹前隆 2003 學位論文 ; thesis 85 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 元智大學 === 資訊管理研究所 === 91 === Cluster analysis is a kind of data mining techniques, and its goal is to find the hidden patterns from the data. In related studies, most of the reseachera use equal weight to cluster data and only use metric calculation to deal with four kinds of scales .We believe traditional clustering algorithm can be incorporated with expert''s subjective judgment. And different scales -- Nominal, Ordinal, Interval and Ratio, should have different methods to calculate the degree of similarity. So we try to combine expert''s weight and multi-scale into clustering process. Our purpose is to solve the problems that clustering result is hard to explain and result can''t meet the decision marker''s need. In this paper, we propose a two-staged clustering algorithm to solve these problems. In the first-staged, we use the training data to find some parameters that can improve our cluster quality. And we cluster all data and these parameters in the second-staged. In our algorithm, we use multi-scales and unequal weight to calculate all kinds of data and use four standard data sets (Wisconsin Breast Cancer Data, Contraceptive Method Choice Data, Iris Education Data and Balance Scale Weight & Distance Data) to test our algorithm. In the end we find better quality of clustering results in using multi-scale and better prediction with expert''s weight, we find two conclusions in our experiments. First, clustering use multiple scale calculation can improve the quality of similarity within group and dissimilarity between groups. Second, clustering with expert''s weight has better prediction than clustering with equal weight. So we believe multi-scales with expert''s weight clustering algorithm can not only improve clustering quality but also meets decision marker''s requirement
author2	Chien-Lung Chan
author_facet	Chien-Lung Chan Rung-Ting Chien 簡榮廷
author	Rung-Ting Chien 簡榮廷
spellingShingle	Rung-Ting Chien 簡榮廷 A Two-staged Clustering Algorithm with Multiple Scales
author_sort	Rung-Ting Chien
title	A Two-staged Clustering Algorithm with Multiple Scales
title_short	A Two-staged Clustering Algorithm with Multiple Scales
title_full	A Two-staged Clustering Algorithm with Multiple Scales
title_fullStr	A Two-staged Clustering Algorithm with Multiple Scales
title_full_unstemmed	A Two-staged Clustering Algorithm with Multiple Scales
title_sort	two-staged clustering algorithm with multiple scales
publishDate	2003
url	http://ndltd.ncl.edu.tw/handle/15988641902381400969
work_keys_str_mv	AT rungtingchien atwostagedclusteringalgorithmwithmultiplescales AT jiǎnróngtíng atwostagedclusteringalgorithmwithmultiplescales AT rungtingchien duōchǐdùzīliàodeèrjiēduànfēnqúnyǎnsuànfǎ AT jiǎnróngtíng duōchǐdùzīliàodeèrjiēduànfēnqúnyǎnsuànfǎ AT rungtingchien twostagedclusteringalgorithmwithmultiplescales AT jiǎnróngtíng twostagedclusteringalgorithmwithmultiplescales
_version_	1717739448696832000

A Two-staged Clustering Algorithm with Multiple Scales

Similar Items