Concept Separation by Manifold Learning Approach
碩士 === 國立臺灣科技大學 === 資訊工程系 === 95 === Abstract We usually rely on effective search engines to maximize the usage of rich web information. However, in many situations, the query input is in limited size and even the powerful search engine can not catch the actual goal from the users. Statistics report...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2007
|
Online Access: | http://ndltd.ncl.edu.tw/handle/g7nxtw |
id |
ndltd-TW-095NTUS5392076 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-095NTUS53920762019-05-15T19:47:45Z http://ndltd.ncl.edu.tw/handle/g7nxtw Concept Separation by Manifold Learning Approach ConceptSeparationbyManifoldLearningApproach Chih-Sung Lai 賴志松 碩士 國立臺灣科技大學 資訊工程系 95 Abstract We usually rely on effective search engines to maximize the usage of rich web information. However, in many situations, the query input is in limited size and even the powerful search engine can not catch the actual goal from the users. Statistics report that there is merely one keyword submitted to the search engines by 77% users approximately, and queries submitted by 85% users contain less than three keywords. We call such queries as incomplete queries. Our goal is to separate the main concept corresponding to a query into sub-concept where one of them may be related to the real interest of the user. The documents usually lie on a very high dimensional space where the existence of a keyword means a dimension to be considered. Therefore, we propose a dimension reduction method based on a manifold learning approach for the clustering process. We adopt Isomap for dimension reduction. Experimentally, after the Isomap process, the dimension reduced dataset gives us better presentation of the dataset. Also, due to the data size is reduced, the execution time of the whole mining process can be close to real time. Several variants of Isomap will also be studied in this thesis. Hsing-Kuo Pao 鮑興國 2007 學位論文 ; thesis 71 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 資訊工程系 === 95 === Abstract
We usually rely on effective search engines to maximize the usage of rich web information. However, in many situations, the query input is in limited size and even the powerful search engine can not catch the actual goal from the users. Statistics report that there is merely one keyword submitted to the search engines by 77% users approximately, and queries submitted by 85% users contain less than three keywords. We call such queries as incomplete queries. Our goal is to separate the main concept corresponding to a query into sub-concept where one of them may be related to the real interest of the user. The documents usually lie on a very high dimensional space where the existence of a keyword means a dimension to be considered. Therefore, we propose a dimension reduction method based on a manifold learning approach for the clustering process. We adopt Isomap for dimension reduction. Experimentally, after the Isomap process, the dimension reduced dataset gives us better presentation of the dataset. Also, due to the data size is reduced, the execution time of the whole mining process can be close to real time. Several variants of Isomap will also be studied in this thesis.
|
author2 |
Hsing-Kuo Pao |
author_facet |
Hsing-Kuo Pao Chih-Sung Lai 賴志松 |
author |
Chih-Sung Lai 賴志松 |
spellingShingle |
Chih-Sung Lai 賴志松 Concept Separation by Manifold Learning Approach |
author_sort |
Chih-Sung Lai |
title |
Concept Separation by Manifold Learning Approach |
title_short |
Concept Separation by Manifold Learning Approach |
title_full |
Concept Separation by Manifold Learning Approach |
title_fullStr |
Concept Separation by Manifold Learning Approach |
title_full_unstemmed |
Concept Separation by Manifold Learning Approach |
title_sort |
concept separation by manifold learning approach |
publishDate |
2007 |
url |
http://ndltd.ncl.edu.tw/handle/g7nxtw |
work_keys_str_mv |
AT chihsunglai conceptseparationbymanifoldlearningapproach AT làizhìsōng conceptseparationbymanifoldlearningapproach |
_version_ |
1719093931318181888 |