Semi-Automatic Construction of Chinese WordNet -- Using Class-based Translation Model

碩士 === 國立清華大學 === 資訊工程學系 === 90 === WordNet is a lexical database, which organizes English nouns, verbs, adjectives and adverbs according to word sense and relationship between senses. It has been applied increasingly to many knowledge-based NLP tasks as main lexical resource, because of...

Full description

Bibliographic Details
Main Authors: ChingTing Hsieh, 謝靜婷
Other Authors: Jason S. Chang
Format: Others
Language:en_US
Published: 2002
Online Access:http://ndltd.ncl.edu.tw/handle/57916890806827068811
id ndltd-TW-090NTHU0392059
record_format oai_dc
spelling ndltd-TW-090NTHU03920592015-10-13T10:34:06Z http://ndltd.ncl.edu.tw/handle/57916890806827068811 Semi-Automatic Construction of Chinese WordNet -- Using Class-based Translation Model 半自動建立中文WordNet之研究 ChingTing Hsieh 謝靜婷 碩士 國立清華大學 資訊工程學系 90 WordNet is a lexical database, which organizes English nouns, verbs, adjectives and adverbs according to word sense and relationship between senses. It has been applied increasingly to many knowledge-based NLP tasks as main lexical resource, because of it wide-coverage semantic and conceptual information. WordNets for many European languages other then English are being developed in recent years. This paper proposes an approach to semi-automatic construction of Chinese WordNet using a class-based statistical model. Our approach to the problem of constructing Chinese WordNet is via translation of English WordNet. The main problem we have to tackle is to select the appropriate word translation for each word sense. We observe that English words for a common concept tend to have common Chinese characters in their translations. Our method consists of 1) classifying English words into several semantic classes and 2) building a class-based statistical model for estimating word translation probabilities. We have carried out experiments on handling nouns in the WordNet and evaluate our results based on coverage and recall rate. The evaluation shows our approach can achieve 76.43% coverage. The recall rate is 70%, 80% and 90% when top 1, top 2, and top 3 translations are used respectively. Jason S. Chang 張俊盛 2002 學位論文 ; thesis 78 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 90 === WordNet is a lexical database, which organizes English nouns, verbs, adjectives and adverbs according to word sense and relationship between senses. It has been applied increasingly to many knowledge-based NLP tasks as main lexical resource, because of it wide-coverage semantic and conceptual information. WordNets for many European languages other then English are being developed in recent years. This paper proposes an approach to semi-automatic construction of Chinese WordNet using a class-based statistical model. Our approach to the problem of constructing Chinese WordNet is via translation of English WordNet. The main problem we have to tackle is to select the appropriate word translation for each word sense. We observe that English words for a common concept tend to have common Chinese characters in their translations. Our method consists of 1) classifying English words into several semantic classes and 2) building a class-based statistical model for estimating word translation probabilities. We have carried out experiments on handling nouns in the WordNet and evaluate our results based on coverage and recall rate. The evaluation shows our approach can achieve 76.43% coverage. The recall rate is 70%, 80% and 90% when top 1, top 2, and top 3 translations are used respectively.
author2 Jason S. Chang
author_facet Jason S. Chang
ChingTing Hsieh
謝靜婷
author ChingTing Hsieh
謝靜婷
spellingShingle ChingTing Hsieh
謝靜婷
Semi-Automatic Construction of Chinese WordNet -- Using Class-based Translation Model
author_sort ChingTing Hsieh
title Semi-Automatic Construction of Chinese WordNet -- Using Class-based Translation Model
title_short Semi-Automatic Construction of Chinese WordNet -- Using Class-based Translation Model
title_full Semi-Automatic Construction of Chinese WordNet -- Using Class-based Translation Model
title_fullStr Semi-Automatic Construction of Chinese WordNet -- Using Class-based Translation Model
title_full_unstemmed Semi-Automatic Construction of Chinese WordNet -- Using Class-based Translation Model
title_sort semi-automatic construction of chinese wordnet -- using class-based translation model
publishDate 2002
url http://ndltd.ncl.edu.tw/handle/57916890806827068811
work_keys_str_mv AT chingtinghsieh semiautomaticconstructionofchinesewordnetusingclassbasedtranslationmodel
AT xièjìngtíng semiautomaticconstructionofchinesewordnetusingclassbasedtranslationmodel
AT chingtinghsieh bànzìdòngjiànlìzhōngwénwordnetzhīyánjiū
AT xièjìngtíng bànzìdòngjiànlìzhōngwénwordnetzhīyánjiū
_version_ 1716829140970110976