A Study of Data Clustering Method Cooperate with Expert Knowledge

碩士 === 東海大學 === 工業工程與經營資訊學系 === 92 === Clustering has its root in many areas, including data mining, statistics, biology, and machine learning. Clustering is the process of grouping data into classes. The objects within a cluster have high similarity in specific features comparison with each another...

Full description

Bibliographic Details
Main Authors: Yao Hui-Lin, 姚輝林
Other Authors: Wang Wei-Hua Andrew
Format: Others
Language:zh-TW
Published: 2004
Online Access:http://ndltd.ncl.edu.tw/handle/59573961926055167594
id ndltd-TW-092THU00030024
record_format oai_dc
spelling ndltd-TW-092THU000300242016-06-15T04:17:30Z http://ndltd.ncl.edu.tw/handle/59573961926055167594 A Study of Data Clustering Method Cooperate with Expert Knowledge 導入專家知識的資料群聚方法之研究 Yao Hui-Lin 姚輝林 碩士 東海大學 工業工程與經營資訊學系 92 Clustering has its root in many areas, including data mining, statistics, biology, and machine learning. Clustering is the process of grouping data into classes. The objects within a cluster have high similarity in specific features comparison with each another, yet, are very dissimilar to objects in other clusters. In the past, lots of the clustering algorithms are proposed. Dissimilarities are assessed based on the attribute values describing the objects. In general, the differences are characterized by similarity measures. However, it used the properties of data to be the basis of analysis in most clustering algorithms. Since data can only provide the relationships between attributes, but not the causalities. The lack of domain knowledge causes to some clustering mistakes. In this paper, we provide the mechanism to cooperate the data clustering algorithm with the expert''s domain knowledge. This research proposed a new similarity measure; the measure can set each attribute with different weight to calculate the similarity. This research adds the expert''s domain knowledge into the data clustering algorithm by means of a new similarity measure. The ZOO dataset and the PET dataset are used to validate the effectiveness of the new algorithm. Wang Wei-Hua Andrew 王偉華 2004 學位論文 ; thesis 57 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 東海大學 === 工業工程與經營資訊學系 === 92 === Clustering has its root in many areas, including data mining, statistics, biology, and machine learning. Clustering is the process of grouping data into classes. The objects within a cluster have high similarity in specific features comparison with each another, yet, are very dissimilar to objects in other clusters. In the past, lots of the clustering algorithms are proposed. Dissimilarities are assessed based on the attribute values describing the objects. In general, the differences are characterized by similarity measures. However, it used the properties of data to be the basis of analysis in most clustering algorithms. Since data can only provide the relationships between attributes, but not the causalities. The lack of domain knowledge causes to some clustering mistakes. In this paper, we provide the mechanism to cooperate the data clustering algorithm with the expert''s domain knowledge. This research proposed a new similarity measure; the measure can set each attribute with different weight to calculate the similarity. This research adds the expert''s domain knowledge into the data clustering algorithm by means of a new similarity measure. The ZOO dataset and the PET dataset are used to validate the effectiveness of the new algorithm.
author2 Wang Wei-Hua Andrew
author_facet Wang Wei-Hua Andrew
Yao Hui-Lin
姚輝林
author Yao Hui-Lin
姚輝林
spellingShingle Yao Hui-Lin
姚輝林
A Study of Data Clustering Method Cooperate with Expert Knowledge
author_sort Yao Hui-Lin
title A Study of Data Clustering Method Cooperate with Expert Knowledge
title_short A Study of Data Clustering Method Cooperate with Expert Knowledge
title_full A Study of Data Clustering Method Cooperate with Expert Knowledge
title_fullStr A Study of Data Clustering Method Cooperate with Expert Knowledge
title_full_unstemmed A Study of Data Clustering Method Cooperate with Expert Knowledge
title_sort study of data clustering method cooperate with expert knowledge
publishDate 2004
url http://ndltd.ncl.edu.tw/handle/59573961926055167594
work_keys_str_mv AT yaohuilin astudyofdataclusteringmethodcooperatewithexpertknowledge
AT yáohuīlín astudyofdataclusteringmethodcooperatewithexpertknowledge
AT yaohuilin dǎorùzhuānjiāzhīshídezīliàoqúnjùfāngfǎzhīyánjiū
AT yáohuīlín dǎorùzhuānjiāzhīshídezīliàoqúnjùfāngfǎzhīyánjiū
AT yaohuilin studyofdataclusteringmethodcooperatewithexpertknowledge
AT yáohuīlín studyofdataclusteringmethodcooperatewithexpertknowledge
_version_ 1718306085868142592