A Study of Feature Selection and Pre-Sorting Strategy for COBWEB Algorithms
碩士 === 南台科技大學 === 資訊管理系 === 93 === This thesis studies hierarchical conceptual clustering for feature selection and search strategy. Clustering is often used for discovering structure in data. Clustering systems differ in the objective function used to evaluate clustering quality and control strateg...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2005
|
Online Access: | http://ndltd.ncl.edu.tw/handle/13626630496115554892 |
id |
ndltd-TW-093STUT0396035 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093STUT03960352016-11-22T04:12:13Z http://ndltd.ncl.edu.tw/handle/13626630496115554892 A Study of Feature Selection and Pre-Sorting Strategy for COBWEB Algorithms 適用於COBWEB演算法之屬性挑選與預先排序策略研究 Jong Yu Chow 周仲愚 碩士 南台科技大學 資訊管理系 93 This thesis studies hierarchical conceptual clustering for feature selection and search strategy. Clustering is often used for discovering structure in data. Clustering systems differ in the objective function used to evaluate clustering quality and control strategy used to search the space of clustering. Ideally, the search strategy should consistently construct clusterings of high quality, but be computationally inexpensive as well. Hierarchical conceptual clustering by a system known as COBWEB. COBWEB, a conceptual clustering system that store knowledge in conceptual hierarchical , and uses an heuristic evaluation function called category utility that measures the quality for a set of probabilistic categories. In this thesis, we propose two strategies for COBWEB algorithm. Gini index is similar in form to the category utility, and based on standard squared-difference metric. It is used for feature selection and search strategy. More, Gini index and category utility are predicted class labels as probability. In particular, this thesis investigates the presorting strategy, it is more efficiency to reducing ordering effects on input data, furthermore, improve the clustering quality. Continue, being a computationally information gain to removes importantless feature subset, therefore, make lower time complexity in particular clustering. Ideally, these two strategies should consistently construct hierarchical conceptual clustering of high quality as well. Guang Yeh Tung 童冠燁 2005 學位論文 ; thesis 63 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 南台科技大學 === 資訊管理系 === 93 === This thesis studies hierarchical conceptual clustering for feature selection and search strategy.
Clustering is often used for discovering structure in data. Clustering systems differ in the objective function used to evaluate clustering quality and control strategy used to search the space of clustering. Ideally, the search strategy should consistently construct clusterings of high quality, but be computationally inexpensive as well.
Hierarchical conceptual clustering by a system known as COBWEB. COBWEB, a conceptual clustering system that store knowledge in conceptual hierarchical , and uses an heuristic evaluation function called category utility that measures the quality for a set of probabilistic categories.
In this thesis, we propose two strategies for COBWEB algorithm. Gini index is similar in form to the category utility, and based on standard squared-difference metric. It is used for feature selection and search strategy. More, Gini index and category utility are predicted class labels as probability. In particular, this thesis investigates the presorting strategy, it is more efficiency to reducing ordering effects on input data, furthermore, improve the clustering quality. Continue, being a computationally information gain to removes importantless feature subset, therefore, make lower time complexity in particular clustering. Ideally, these two strategies should consistently construct hierarchical conceptual clustering of high quality as well.
|
author2 |
Guang Yeh Tung |
author_facet |
Guang Yeh Tung Jong Yu Chow 周仲愚 |
author |
Jong Yu Chow 周仲愚 |
spellingShingle |
Jong Yu Chow 周仲愚 A Study of Feature Selection and Pre-Sorting Strategy for COBWEB Algorithms |
author_sort |
Jong Yu Chow |
title |
A Study of Feature Selection and Pre-Sorting Strategy for COBWEB Algorithms |
title_short |
A Study of Feature Selection and Pre-Sorting Strategy for COBWEB Algorithms |
title_full |
A Study of Feature Selection and Pre-Sorting Strategy for COBWEB Algorithms |
title_fullStr |
A Study of Feature Selection and Pre-Sorting Strategy for COBWEB Algorithms |
title_full_unstemmed |
A Study of Feature Selection and Pre-Sorting Strategy for COBWEB Algorithms |
title_sort |
study of feature selection and pre-sorting strategy for cobweb algorithms |
publishDate |
2005 |
url |
http://ndltd.ncl.edu.tw/handle/13626630496115554892 |
work_keys_str_mv |
AT jongyuchow astudyoffeatureselectionandpresortingstrategyforcobwebalgorithms AT zhōuzhòngyú astudyoffeatureselectionandpresortingstrategyforcobwebalgorithms AT jongyuchow shìyòngyúcobwebyǎnsuànfǎzhīshǔxìngtiāoxuǎnyǔyùxiānpáixùcèlüèyánjiū AT zhōuzhòngyú shìyòngyúcobwebyǎnsuànfǎzhīshǔxìngtiāoxuǎnyǔyùxiānpáixùcèlüèyánjiū AT jongyuchow studyoffeatureselectionandpresortingstrategyforcobwebalgorithms AT zhōuzhòngyú studyoffeatureselectionandpresortingstrategyforcobwebalgorithms |
_version_ |
1718396254952620032 |