Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus
碩士 === 元智大學 === 資訊工程學系 === 97 === In our observations, we find that the inequality problem exists in the amount of Web pages of different languages. For example, the ODP directory contains a large number of English Web pages, but only has a relatively small number of Chinese and Korean Web pages. Ho...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2009
|
Online Access: | http://ndltd.ncl.edu.tw/handle/49294977700724640062 |
id |
ndltd-TW-097YZU05392063 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-097YZU053920632016-05-04T04:17:10Z http://ndltd.ncl.edu.tw/handle/49294977700724640062 Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus 以跨語言階層索引典輔助網頁目錄自動化建構 Cheng-Yu Chen 陳政瑜 碩士 元智大學 資訊工程學系 97 In our observations, we find that the inequality problem exists in the amount of Web pages of different languages. For example, the ODP directory contains a large number of English Web pages, but only has a relatively small number of Chinese and Korean Web pages. However, some Web taxonomies actually contain many Chinese and Korean Web pages than ODP. Therefore, we plan to use these abundant Web resources to fertilize the content of non-English ODP taxonomies. Since non-English ODP directories have rare Web pages, we utilize English ODP directory as an external hierarchical thesaurus to help the construction of non-English ODP directories. The external cross-lingual hierarchical thesaurus has been employed in a hierarchical catalog integration scheme to construct non-English Web taxonomies. As shown in our experiments, the construction performance is therefore improved with the cross-lingual hierarchical thesaurus. Cheng-Zen Yang 楊正仁 2009 學位論文 ; thesis 33 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 元智大學 === 資訊工程學系 === 97 === In our observations, we find that the inequality problem exists in the amount of Web pages of different languages.
For example, the ODP directory contains a large number of English Web pages, but only has a relatively small number of Chinese and Korean Web pages.
However, some Web taxonomies actually contain many Chinese and Korean Web pages than ODP.
Therefore, we plan to use these abundant Web resources to fertilize the content of non-English ODP taxonomies.
Since non-English ODP directories have rare Web pages,
we utilize English ODP directory as an external hierarchical thesaurus to help the construction of non-English ODP directories.
The external cross-lingual hierarchical thesaurus has been employed in a hierarchical catalog integration scheme to construct non-English Web taxonomies.
As shown in our experiments, the construction performance is therefore improved with the cross-lingual hierarchical thesaurus.
|
author2 |
Cheng-Zen Yang |
author_facet |
Cheng-Zen Yang Cheng-Yu Chen 陳政瑜 |
author |
Cheng-Yu Chen 陳政瑜 |
spellingShingle |
Cheng-Yu Chen 陳政瑜 Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus |
author_sort |
Cheng-Yu Chen |
title |
Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus |
title_short |
Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus |
title_full |
Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus |
title_fullStr |
Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus |
title_full_unstemmed |
Web Taxonomy Construction using a Cross-lingual Hierarchical Thesaurus |
title_sort |
web taxonomy construction using a cross-lingual hierarchical thesaurus |
publishDate |
2009 |
url |
http://ndltd.ncl.edu.tw/handle/49294977700724640062 |
work_keys_str_mv |
AT chengyuchen webtaxonomyconstructionusingacrosslingualhierarchicalthesaurus AT chénzhèngyú webtaxonomyconstructionusingacrosslingualhierarchicalthesaurus AT chengyuchen yǐkuàyǔyánjiēcéngsuǒyǐndiǎnfǔzhùwǎngyèmùlùzìdònghuàjiàngòu AT chénzhèngyú yǐkuàyǔyánjiēcéngsuǒyǐndiǎnfǔzhùwǎngyèmùlùzìdònghuàjiàngòu |
_version_ |
1718256539786018816 |