A Study of Data Clustering Method Cooperate with Expert Knowledge
碩士 === 東海大學 === 工業工程與經營資訊學系 === 92 === Clustering has its root in many areas, including data mining, statistics, biology, and machine learning. Clustering is the process of grouping data into classes. The objects within a cluster have high similarity in specific features comparison with each another...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2004
|
Online Access: | http://ndltd.ncl.edu.tw/handle/59573961926055167594 |
id |
ndltd-TW-092THU00030024 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-092THU000300242016-06-15T04:17:30Z http://ndltd.ncl.edu.tw/handle/59573961926055167594 A Study of Data Clustering Method Cooperate with Expert Knowledge 導入專家知識的資料群聚方法之研究 Yao Hui-Lin 姚輝林 碩士 東海大學 工業工程與經營資訊學系 92 Clustering has its root in many areas, including data mining, statistics, biology, and machine learning. Clustering is the process of grouping data into classes. The objects within a cluster have high similarity in specific features comparison with each another, yet, are very dissimilar to objects in other clusters. In the past, lots of the clustering algorithms are proposed. Dissimilarities are assessed based on the attribute values describing the objects. In general, the differences are characterized by similarity measures. However, it used the properties of data to be the basis of analysis in most clustering algorithms. Since data can only provide the relationships between attributes, but not the causalities. The lack of domain knowledge causes to some clustering mistakes. In this paper, we provide the mechanism to cooperate the data clustering algorithm with the expert''s domain knowledge. This research proposed a new similarity measure; the measure can set each attribute with different weight to calculate the similarity. This research adds the expert''s domain knowledge into the data clustering algorithm by means of a new similarity measure. The ZOO dataset and the PET dataset are used to validate the effectiveness of the new algorithm. Wang Wei-Hua Andrew 王偉華 2004 學位論文 ; thesis 57 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 東海大學 === 工業工程與經營資訊學系 === 92 === Clustering has its root in many areas, including data mining, statistics, biology, and machine learning. Clustering is the process of grouping data into classes. The objects within a cluster have high similarity in specific features comparison with each another, yet, are very dissimilar to objects in other clusters. In the past, lots of the clustering algorithms are proposed. Dissimilarities are assessed based on the attribute values describing the objects. In general, the differences are characterized by similarity measures. However, it used the properties of data to be the basis of analysis in most clustering algorithms. Since data can only provide the relationships between attributes, but not the causalities. The lack of domain knowledge causes to some clustering mistakes. In this paper, we provide the mechanism to cooperate the data clustering algorithm with the expert''s domain knowledge.
This research proposed a new similarity measure; the measure can set each attribute with different weight to calculate the similarity. This research adds the expert''s domain knowledge into the data clustering algorithm by means of a new similarity measure.
The ZOO dataset and the PET dataset are used to validate the effectiveness of the new algorithm.
|
author2 |
Wang Wei-Hua Andrew |
author_facet |
Wang Wei-Hua Andrew Yao Hui-Lin 姚輝林 |
author |
Yao Hui-Lin 姚輝林 |
spellingShingle |
Yao Hui-Lin 姚輝林 A Study of Data Clustering Method Cooperate with Expert Knowledge |
author_sort |
Yao Hui-Lin |
title |
A Study of Data Clustering Method Cooperate with Expert Knowledge |
title_short |
A Study of Data Clustering Method Cooperate with Expert Knowledge |
title_full |
A Study of Data Clustering Method Cooperate with Expert Knowledge |
title_fullStr |
A Study of Data Clustering Method Cooperate with Expert Knowledge |
title_full_unstemmed |
A Study of Data Clustering Method Cooperate with Expert Knowledge |
title_sort |
study of data clustering method cooperate with expert knowledge |
publishDate |
2004 |
url |
http://ndltd.ncl.edu.tw/handle/59573961926055167594 |
work_keys_str_mv |
AT yaohuilin astudyofdataclusteringmethodcooperatewithexpertknowledge AT yáohuīlín astudyofdataclusteringmethodcooperatewithexpertknowledge AT yaohuilin dǎorùzhuānjiāzhīshídezīliàoqúnjùfāngfǎzhīyánjiū AT yáohuīlín dǎorùzhuānjiāzhīshídezīliàoqúnjùfāngfǎzhīyánjiū AT yaohuilin studyofdataclusteringmethodcooperatewithexpertknowledge AT yáohuīlín studyofdataclusteringmethodcooperatewithexpertknowledge |
_version_ |
1718306085868142592 |