Tagging Simple Short Phrase in Taiwanese Using their Mandarin Counterparts in a Parallel Corpus
碩士 === 國立清華大學 === 統計學研究所 === 99 === In preparing Taiwanese-Mandarin parallel corpus written in Langgeh orthography (that is, with spaces between simple short phrases), previous study explores the tagging of Mandarin simple short phrases. This paper continues the study on tagging Taiwanese simple sho...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/73663830169897348183 |
id |
ndltd-TW-099NTHU5337005 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099NTHU53370052015-10-13T20:04:06Z http://ndltd.ncl.edu.tw/handle/73663830169897348183 Tagging Simple Short Phrase in Taiwanese Using their Mandarin Counterparts in a Parallel Corpus 台華 平行語料中 台語 簡短詞組 的 詞類標記 李柏宏 碩士 國立清華大學 統計學研究所 99 In preparing Taiwanese-Mandarin parallel corpus written in Langgeh orthography (that is, with spaces between simple short phrases), previous study explores the tagging of Mandarin simple short phrases. This paper continues the study on tagging Taiwanese simple short phrases using their Mandarin counterparts. With aim as aid to the semi-automatic tagging process, we emphasize the full correctness of tagging procedures; a partially correct tagging procedure requires manual inspection of the tagging results, an effort no less than full manual tagging. After exploring several possibilities, we come up with a simple tagging procedure that attains 100% correctness in tagging our experimental corpus, although with only 55% recall rate among all Taiwanese simple short phrases. 江永進 2011 學位論文 ; thesis 44 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 統計學研究所 === 99 === In preparing Taiwanese-Mandarin parallel corpus written in Langgeh orthography (that is, with spaces between simple short phrases), previous study explores the tagging of Mandarin simple short phrases. This paper continues the study on tagging Taiwanese simple short phrases using their Mandarin counterparts. With aim as aid to the semi-automatic tagging process, we emphasize the full correctness of tagging procedures; a partially correct tagging procedure requires manual inspection of the tagging results, an effort no less than full manual tagging. After exploring several possibilities, we come up with a simple tagging procedure that attains 100% correctness in tagging our experimental corpus, although with only 55% recall rate among all Taiwanese simple short phrases.
|
author2 |
江永進 |
author_facet |
江永進 李柏宏 |
author |
李柏宏 |
spellingShingle |
李柏宏 Tagging Simple Short Phrase in Taiwanese Using their Mandarin Counterparts in a Parallel Corpus |
author_sort |
李柏宏 |
title |
Tagging Simple Short Phrase in Taiwanese Using their Mandarin Counterparts in a Parallel Corpus |
title_short |
Tagging Simple Short Phrase in Taiwanese Using their Mandarin Counterparts in a Parallel Corpus |
title_full |
Tagging Simple Short Phrase in Taiwanese Using their Mandarin Counterparts in a Parallel Corpus |
title_fullStr |
Tagging Simple Short Phrase in Taiwanese Using their Mandarin Counterparts in a Parallel Corpus |
title_full_unstemmed |
Tagging Simple Short Phrase in Taiwanese Using their Mandarin Counterparts in a Parallel Corpus |
title_sort |
tagging simple short phrase in taiwanese using their mandarin counterparts in a parallel corpus |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/73663830169897348183 |
work_keys_str_mv |
AT lǐbǎihóng taggingsimpleshortphraseintaiwaneseusingtheirmandarincounterpartsinaparallelcorpus AT lǐbǎihóng táihuápíngxíngyǔliàozhōngtáiyǔjiǎnduǎncízǔdecílèibiāojì |
_version_ |
1718043852395249664 |