Automatic Chinese Wordset Generation nd Chinese Spelling Check
碩士 === 國立中正大學 === 資訊工程研究所 === 83 === Spelling Check is a frequently used function in English word processing applications. But for Chinese, this function is not easy to do, because, first, there are no word delimiters in Chinese writing, the sentences are composed by cotinuing Chinese characters....
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
1995
|
Online Access: | http://ndltd.ncl.edu.tw/handle/83857287657094746290 |
id |
ndltd-TW-083CCU03392023 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-083CCU033920232016-02-08T04:06:37Z http://ndltd.ncl.edu.tw/handle/83857287657094746290 Automatic Chinese Wordset Generation nd Chinese Spelling Check 中文詞集自動產生及中文拼字檢查 Liu, Meng-Dar 劉孟達 碩士 國立中正大學 資訊工程研究所 83 Spelling Check is a frequently used function in English word processing applications. But for Chinese, this function is not easy to do, because, first, there are no word delimiters in Chinese writing, the sentences are composed by cotinuing Chinese characters. Second, many Chinese characters can be used as single-character words. Third, the characters can compose words with different character number. Besides, some short words can be composed to be the long words. All these features will bring some difficulties in Chinese word processing. Thus, in spelling checking, we need to process word identification at the same time. We use dictionary lookup and long-word-first policy to identify words. The completeness of the wordbase (dictionary) will influence the correction rate of spelling check. So we use the relationship of characters in the articles to judge whether the character string can compose a word. By using this method to collect these words which are not in general wordbase, such as name of people, name of place and some terminologies, this method can help users create their own wordbase to improve the performance of spelling check. Another topic of this project is the correction of spelling errors. Because many users use phonetic input-method which may cause many homophonic errors, we developed a method to correct this kind of errors. Wu, Sun 吳昇 1995 學位論文 ; thesis 56 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中正大學 === 資訊工程研究所 === 83 === Spelling Check is a frequently used function in English word processing applications. But for Chinese, this function is not easy to do, because, first, there are no word delimiters in Chinese writing, the sentences are composed by cotinuing Chinese characters. Second, many Chinese characters can be used as single-character words. Third, the characters can compose words with different character number. Besides, some short words can be composed to be the long words. All these features will bring some difficulties in Chinese word processing.
Thus, in spelling checking, we need to process word identification at the same time. We use dictionary lookup and long-word-first policy to identify words. The completeness of the wordbase (dictionary) will influence the correction rate of spelling check. So we use the relationship of characters in the articles to judge whether the character string can compose a word. By using this method to collect these words which are not in general wordbase, such as name of people, name of place and some terminologies, this method can help users create their own wordbase to improve the performance of spelling check.
Another topic of this project is the correction of spelling errors. Because many users use phonetic input-method which may cause many homophonic errors, we developed a method to correct this kind of errors.
|
author2 |
Wu, Sun |
author_facet |
Wu, Sun Liu, Meng-Dar 劉孟達 |
author |
Liu, Meng-Dar 劉孟達 |
spellingShingle |
Liu, Meng-Dar 劉孟達 Automatic Chinese Wordset Generation nd Chinese Spelling Check |
author_sort |
Liu, Meng-Dar |
title |
Automatic Chinese Wordset Generation nd Chinese Spelling Check |
title_short |
Automatic Chinese Wordset Generation nd Chinese Spelling Check |
title_full |
Automatic Chinese Wordset Generation nd Chinese Spelling Check |
title_fullStr |
Automatic Chinese Wordset Generation nd Chinese Spelling Check |
title_full_unstemmed |
Automatic Chinese Wordset Generation nd Chinese Spelling Check |
title_sort |
automatic chinese wordset generation nd chinese spelling check |
publishDate |
1995 |
url |
http://ndltd.ncl.edu.tw/handle/83857287657094746290 |
work_keys_str_mv |
AT liumengdar automaticchinesewordsetgenerationndchinesespellingcheck AT liúmèngdá automaticchinesewordsetgenerationndchinesespellingcheck AT liumengdar zhōngwéncíjízìdòngchǎnshēngjízhōngwénpīnzìjiǎnchá AT liúmèngdá zhōngwéncíjízìdòngchǎnshēngjízhōngwénpīnzìjiǎnchá |
_version_ |
1718183087193456640 |