Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus

碩士 === 元智大學 === 資訊工程學系 === 97 === In our observations, we find that the inequality problem exists in the amount of Web pages of different languages. For example, the ODP directory contains a large number of English Web pages, but only has a relatively small number of Chinese and Korean Web pages. Ho...

Full description

Bibliographic Details
Main Authors: Cheng-Yu Chen, 陳政瑜
Other Authors: Cheng-Zen Yang
Format: Others
Language:zh-TW
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/49294977700724640062
id ndltd-TW-097YZU05392063
record_format oai_dc
spelling ndltd-TW-097YZU053920632016-05-04T04:17:10Z http://ndltd.ncl.edu.tw/handle/49294977700724640062 Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus 以跨語言階層索引典輔助網頁目錄自動化建構 Cheng-Yu Chen 陳政瑜 碩士 元智大學 資訊工程學系 97 In our observations, we find that the inequality problem exists in the amount of Web pages of different languages. For example, the ODP directory contains a large number of English Web pages, but only has a relatively small number of Chinese and Korean Web pages. However, some Web taxonomies actually contain many Chinese and Korean Web pages than ODP. Therefore, we plan to use these abundant Web resources to fertilize the content of non-English ODP taxonomies. Since non-English ODP directories have rare Web pages, we utilize English ODP directory as an external hierarchical thesaurus to help the construction of non-English ODP directories. The external cross-lingual hierarchical thesaurus has been employed in a hierarchical catalog integration scheme to construct non-English Web taxonomies. As shown in our experiments, the construction performance is therefore improved with the cross-lingual hierarchical thesaurus. Cheng-Zen Yang 楊正仁 2009 學位論文 ; thesis 33 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元智大學 === 資訊工程學系 === 97 === In our observations, we find that the inequality problem exists in the amount of Web pages of different languages. For example, the ODP directory contains a large number of English Web pages, but only has a relatively small number of Chinese and Korean Web pages. However, some Web taxonomies actually contain many Chinese and Korean Web pages than ODP. Therefore, we plan to use these abundant Web resources to fertilize the content of non-English ODP taxonomies. Since non-English ODP directories have rare Web pages, we utilize English ODP directory as an external hierarchical thesaurus to help the construction of non-English ODP directories. The external cross-lingual hierarchical thesaurus has been employed in a hierarchical catalog integration scheme to construct non-English Web taxonomies. As shown in our experiments, the construction performance is therefore improved with the cross-lingual hierarchical thesaurus.
author2 Cheng-Zen Yang
author_facet Cheng-Zen Yang
Cheng-Yu Chen
陳政瑜
author Cheng-Yu Chen
陳政瑜
spellingShingle Cheng-Yu Chen
陳政瑜
Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus
author_sort Cheng-Yu Chen
title Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus
title_short Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus
title_full Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus
title_fullStr Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus
title_full_unstemmed Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus
title_sort web taxonomy construction using a cross-lingual hierarchical thesaurus
publishDate 2009
url http://ndltd.ncl.edu.tw/handle/49294977700724640062
work_keys_str_mv AT chengyuchen webtaxonomyconstructionusingacrosslingualhierarchicalthesaurus
AT chénzhèngyú webtaxonomyconstructionusingacrosslingualhierarchicalthesaurus
AT chengyuchen yǐkuàyǔyánjiēcéngsuǒyǐndiǎnfǔzhùwǎngyèmùlùzìdònghuàjiàngòu
AT chénzhèngyú yǐkuàyǔyánjiēcéngsuǒyǐndiǎnfǔzhùwǎngyèmùlùzìdònghuàjiàngòu
_version_ 1718256539786018816