A Two-staged Clustering Algorithm with Multiple Scales

碩士 === 元智大學 === 資訊管理研究所 === 91 === Cluster analysis is a kind of data mining techniques, and its goal is to find the hidden patterns from the data. In related studies, most of the reseachera use equal weight to cluster data and only use metric calculation to deal with four kinds of scales .We believ...

Full description

Bibliographic Details
Main Authors: Rung-Ting Chien, 簡榮廷
Other Authors: Chien-Lung Chan
Format: Others
Language:zh-TW
Published: 2003
Online Access:http://ndltd.ncl.edu.tw/handle/15988641902381400969
id ndltd-TW-091YZU00396029
record_format oai_dc
spelling ndltd-TW-091YZU003960292015-10-13T13:39:20Z http://ndltd.ncl.edu.tw/handle/15988641902381400969 A Two-staged Clustering Algorithm with Multiple Scales 多尺度資料的二階段分群演算法 Rung-Ting Chien 簡榮廷 碩士 元智大學 資訊管理研究所 91 Cluster analysis is a kind of data mining techniques, and its goal is to find the hidden patterns from the data. In related studies, most of the reseachera use equal weight to cluster data and only use metric calculation to deal with four kinds of scales .We believe traditional clustering algorithm can be incorporated with expert''s subjective judgment. And different scales -- Nominal, Ordinal, Interval and Ratio, should have different methods to calculate the degree of similarity. So we try to combine expert''s weight and multi-scale into clustering process. Our purpose is to solve the problems that clustering result is hard to explain and result can''t meet the decision marker''s need. In this paper, we propose a two-staged clustering algorithm to solve these problems. In the first-staged, we use the training data to find some parameters that can improve our cluster quality. And we cluster all data and these parameters in the second-staged. In our algorithm, we use multi-scales and unequal weight to calculate all kinds of data and use four standard data sets (Wisconsin Breast Cancer Data, Contraceptive Method Choice Data, Iris Education Data and Balance Scale Weight & Distance Data) to test our algorithm. In the end we find better quality of clustering results in using multi-scale and better prediction with expert''s weight, we find two conclusions in our experiments. First, clustering use multiple scale calculation can improve the quality of similarity within group and dissimilarity between groups. Second, clustering with expert''s weight has better prediction than clustering with equal weight. So we believe multi-scales with expert''s weight clustering algorithm can not only improve clustering quality but also meets decision marker''s requirement Chien-Lung Chan 詹前隆 2003 學位論文 ; thesis 85 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元智大學 === 資訊管理研究所 === 91 === Cluster analysis is a kind of data mining techniques, and its goal is to find the hidden patterns from the data. In related studies, most of the reseachera use equal weight to cluster data and only use metric calculation to deal with four kinds of scales .We believe traditional clustering algorithm can be incorporated with expert''s subjective judgment. And different scales -- Nominal, Ordinal, Interval and Ratio, should have different methods to calculate the degree of similarity. So we try to combine expert''s weight and multi-scale into clustering process. Our purpose is to solve the problems that clustering result is hard to explain and result can''t meet the decision marker''s need. In this paper, we propose a two-staged clustering algorithm to solve these problems. In the first-staged, we use the training data to find some parameters that can improve our cluster quality. And we cluster all data and these parameters in the second-staged. In our algorithm, we use multi-scales and unequal weight to calculate all kinds of data and use four standard data sets (Wisconsin Breast Cancer Data, Contraceptive Method Choice Data, Iris Education Data and Balance Scale Weight & Distance Data) to test our algorithm. In the end we find better quality of clustering results in using multi-scale and better prediction with expert''s weight, we find two conclusions in our experiments. First, clustering use multiple scale calculation can improve the quality of similarity within group and dissimilarity between groups. Second, clustering with expert''s weight has better prediction than clustering with equal weight. So we believe multi-scales with expert''s weight clustering algorithm can not only improve clustering quality but also meets decision marker''s requirement
author2 Chien-Lung Chan
author_facet Chien-Lung Chan
Rung-Ting Chien
簡榮廷
author Rung-Ting Chien
簡榮廷
spellingShingle Rung-Ting Chien
簡榮廷
A Two-staged Clustering Algorithm with Multiple Scales
author_sort Rung-Ting Chien
title A Two-staged Clustering Algorithm with Multiple Scales
title_short A Two-staged Clustering Algorithm with Multiple Scales
title_full A Two-staged Clustering Algorithm with Multiple Scales
title_fullStr A Two-staged Clustering Algorithm with Multiple Scales
title_full_unstemmed A Two-staged Clustering Algorithm with Multiple Scales
title_sort two-staged clustering algorithm with multiple scales
publishDate 2003
url http://ndltd.ncl.edu.tw/handle/15988641902381400969
work_keys_str_mv AT rungtingchien atwostagedclusteringalgorithmwithmultiplescales
AT jiǎnróngtíng atwostagedclusteringalgorithmwithmultiplescales
AT rungtingchien duōchǐdùzīliàodeèrjiēduànfēnqúnyǎnsuànfǎ
AT jiǎnróngtíng duōchǐdùzīliàodeèrjiēduànfēnqúnyǎnsuànfǎ
AT rungtingchien twostagedclusteringalgorithmwithmultiplescales
AT jiǎnróngtíng twostagedclusteringalgorithmwithmultiplescales
_version_ 1717739448696832000