Concept Separation by Manifold Learning Approach

碩士 === 國立臺灣科技大學 === 資訊工程系 === 95 === Abstract We usually rely on effective search engines to maximize the usage of rich web information. However, in many situations, the query input is in limited size and even the powerful search engine can not catch the actual goal from the users. Statistics report...

Full description

Bibliographic Details
Main Authors: Chih-Sung Lai, 賴志松
Other Authors: Hsing-Kuo Pao
Format: Others
Language:en_US
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/g7nxtw
id ndltd-TW-095NTUS5392076
record_format oai_dc
spelling ndltd-TW-095NTUS53920762019-05-15T19:47:45Z http://ndltd.ncl.edu.tw/handle/g7nxtw Concept Separation by Manifold Learning Approach ConceptSeparationbyManifoldLearningApproach Chih-Sung Lai 賴志松 碩士 國立臺灣科技大學 資訊工程系 95 Abstract We usually rely on effective search engines to maximize the usage of rich web information. However, in many situations, the query input is in limited size and even the powerful search engine can not catch the actual goal from the users. Statistics report that there is merely one keyword submitted to the search engines by 77% users approximately, and queries submitted by 85% users contain less than three keywords. We call such queries as incomplete queries. Our goal is to separate the main concept corresponding to a query into sub-concept where one of them may be related to the real interest of the user. The documents usually lie on a very high dimensional space where the existence of a keyword means a dimension to be considered. Therefore, we propose a dimension reduction method based on a manifold learning approach for the clustering process. We adopt Isomap for dimension reduction. Experimentally, after the Isomap process, the dimension reduced dataset gives us better presentation of the dataset. Also, due to the data size is reduced, the execution time of the whole mining process can be close to real time. Several variants of Isomap will also be studied in this thesis. Hsing-Kuo Pao 鮑興國 2007 學位論文 ; thesis 71 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 95 === Abstract We usually rely on effective search engines to maximize the usage of rich web information. However, in many situations, the query input is in limited size and even the powerful search engine can not catch the actual goal from the users. Statistics report that there is merely one keyword submitted to the search engines by 77% users approximately, and queries submitted by 85% users contain less than three keywords. We call such queries as incomplete queries. Our goal is to separate the main concept corresponding to a query into sub-concept where one of them may be related to the real interest of the user. The documents usually lie on a very high dimensional space where the existence of a keyword means a dimension to be considered. Therefore, we propose a dimension reduction method based on a manifold learning approach for the clustering process. We adopt Isomap for dimension reduction. Experimentally, after the Isomap process, the dimension reduced dataset gives us better presentation of the dataset. Also, due to the data size is reduced, the execution time of the whole mining process can be close to real time. Several variants of Isomap will also be studied in this thesis.
author2 Hsing-Kuo Pao
author_facet Hsing-Kuo Pao
Chih-Sung Lai
賴志松
author Chih-Sung Lai
賴志松
spellingShingle Chih-Sung Lai
賴志松
Concept Separation by Manifold Learning Approach
author_sort Chih-Sung Lai
title Concept Separation by Manifold Learning Approach
title_short Concept Separation by Manifold Learning Approach
title_full Concept Separation by Manifold Learning Approach
title_fullStr Concept Separation by Manifold Learning Approach
title_full_unstemmed Concept Separation by Manifold Learning Approach
title_sort concept separation by manifold learning approach
publishDate 2007
url http://ndltd.ncl.edu.tw/handle/g7nxtw
work_keys_str_mv AT chihsunglai conceptseparationbymanifoldlearningapproach
AT làizhìsōng conceptseparationbymanifoldlearningapproach
_version_ 1719093931318181888