Hierarchical Catalog Integrate based on the Maximum Entropy Model

碩士 === 元智大學 === 資訊工程學系 === 95 === In many areas, information is organized in catalogs on the Web. Demands of integration two catalogs appear in many applications. These catalogs usually contain a lot of Web documents and have complicated hierarchical structures. Therefore, how to integrate two catal...

Full description

Bibliographic Details
Main Authors: Cheng-Tse Hung, 洪誠澤
Other Authors: Cheng-Zen Yang
Format: Others
Language:zh-TW
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/14842545761557971657
id ndltd-TW-095YZU05392030
record_format oai_dc
spelling ndltd-TW-095YZU053920302016-05-23T04:17:53Z http://ndltd.ncl.edu.tw/handle/14842545761557971657 Hierarchical Catalog Integrate based on the Maximum Entropy Model 基於最大熵模型的階層式目錄整合之研究 Cheng-Tse Hung 洪誠澤 碩士 元智大學 資訊工程學系 95 In many areas, information is organized in catalogs on the Web. Demands of integration two catalogs appear in many applications. These catalogs usually contain a lot of Web documents and have complicated hierarchical structures. Therefore, how to integrate two catalogs accurately becomes an important research topic. For the catalog integration problem, past studies mainly focus on flattened catalogs, and only few papers further discuss the integration of hierarchical catalogs. To the best of our survey, no research has discussed the improvement from additional semantic information on hierarchical catalog integration. This thesis presents an enhancement based on the Maximum Entropy (ME) model using the hierarchical thesaurus information embedded in the catalogs and the additional semantic features expanded from an external corpus. Experimental results on real-world catalogs indicate that the proposed approach consistently improves the integration performance. Cheng-Zen Yang 楊正仁 2007 學位論文 ; thesis 47 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元智大學 === 資訊工程學系 === 95 === In many areas, information is organized in catalogs on the Web. Demands of integration two catalogs appear in many applications. These catalogs usually contain a lot of Web documents and have complicated hierarchical structures. Therefore, how to integrate two catalogs accurately becomes an important research topic. For the catalog integration problem, past studies mainly focus on flattened catalogs, and only few papers further discuss the integration of hierarchical catalogs. To the best of our survey, no research has discussed the improvement from additional semantic information on hierarchical catalog integration. This thesis presents an enhancement based on the Maximum Entropy (ME) model using the hierarchical thesaurus information embedded in the catalogs and the additional semantic features expanded from an external corpus. Experimental results on real-world catalogs indicate that the proposed approach consistently improves the integration performance.
author2 Cheng-Zen Yang
author_facet Cheng-Zen Yang
Cheng-Tse Hung
洪誠澤
author Cheng-Tse Hung
洪誠澤
spellingShingle Cheng-Tse Hung
洪誠澤
Hierarchical Catalog Integrate based on the Maximum Entropy Model
author_sort Cheng-Tse Hung
title Hierarchical Catalog Integrate based on the Maximum Entropy Model
title_short Hierarchical Catalog Integrate based on the Maximum Entropy Model
title_full Hierarchical Catalog Integrate based on the Maximum Entropy Model
title_fullStr Hierarchical Catalog Integrate based on the Maximum Entropy Model
title_full_unstemmed Hierarchical Catalog Integrate based on the Maximum Entropy Model
title_sort hierarchical catalog integrate based on the maximum entropy model
publishDate 2007
url http://ndltd.ncl.edu.tw/handle/14842545761557971657
work_keys_str_mv AT chengtsehung hierarchicalcatalogintegratebasedonthemaximumentropymodel
AT hóngchéngzé hierarchicalcatalogintegratebasedonthemaximumentropymodel
AT chengtsehung jīyúzuìdàshāngmóxíngdejiēcéngshìmùlùzhěnghézhīyánjiū
AT hóngchéngzé jīyúzuìdàshāngmóxíngdejiēcéngshìmùlùzhěnghézhīyánjiū
_version_ 1718278825634168832