Design and Implementation of a Treebank Development Tool

碩士 === 國立臺灣大學 === 資訊工程研究所 === 84 === The syntactic analyzed corpus bring much linguistic information such that they can be use in building a lexicography, speech recognition system, statistical translation system, etc. Before take the adva...

Full description

Bibliographic Details
Main Authors: Shaw,Min-Shin, 蕭敏信
Other Authors: Chen,Hsin-Hsi
Format: Others
Language:zh-TW
Published: 1996
Online Access:http://ndltd.ncl.edu.tw/handle/10766675651666895390
id ndltd-TW-084NTU00392054
record_format oai_dc
spelling ndltd-TW-084NTU003920542016-07-13T04:10:50Z http://ndltd.ncl.edu.tw/handle/10766675651666895390 Design and Implementation of a Treebank Development Tool 樹狀語料庫發展工具之設計與製作 Shaw,Min-Shin 蕭敏信 碩士 國立臺灣大學 資訊工程研究所 84 The syntactic analyzed corpus bring much linguistic information such that they can be use in building a lexicography, speech recognition system, statistical translation system, etc. Before take the advantage of syntactic analyzed corpus, it is important to construct a large scale corpus with high quality. In this thesis, we describe the major problem of constructing the NTU corpus, the inconsistency, how to find the possible inconsistency of whole corpus. And the performance of find the possible inconsistency. We also describe the techniques used to develop a Chinese treebank development system. Setting main constituent, marking uncertain constituent, etc. can be used to improve the quality of the possible parsing tree. The graphic interface shows the structure outline of parsing tree. The structure of parsing tree is also easy to modify. The whole system is developed on MS-Windows. Chen,Hsin-Hsi 陳信希 1996 學位論文 ; thesis 74 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊工程研究所 === 84 === The syntactic analyzed corpus bring much linguistic information such that they can be use in building a lexicography, speech recognition system, statistical translation system, etc. Before take the advantage of syntactic analyzed corpus, it is important to construct a large scale corpus with high quality. In this thesis, we describe the major problem of constructing the NTU corpus, the inconsistency, how to find the possible inconsistency of whole corpus. And the performance of find the possible inconsistency. We also describe the techniques used to develop a Chinese treebank development system. Setting main constituent, marking uncertain constituent, etc. can be used to improve the quality of the possible parsing tree. The graphic interface shows the structure outline of parsing tree. The structure of parsing tree is also easy to modify. The whole system is developed on MS-Windows.
author2 Chen,Hsin-Hsi
author_facet Chen,Hsin-Hsi
Shaw,Min-Shin
蕭敏信
author Shaw,Min-Shin
蕭敏信
spellingShingle Shaw,Min-Shin
蕭敏信
Design and Implementation of a Treebank Development Tool
author_sort Shaw,Min-Shin
title Design and Implementation of a Treebank Development Tool
title_short Design and Implementation of a Treebank Development Tool
title_full Design and Implementation of a Treebank Development Tool
title_fullStr Design and Implementation of a Treebank Development Tool
title_full_unstemmed Design and Implementation of a Treebank Development Tool
title_sort design and implementation of a treebank development tool
publishDate 1996
url http://ndltd.ncl.edu.tw/handle/10766675651666895390
work_keys_str_mv AT shawminshin designandimplementationofatreebankdevelopmenttool
AT xiāomǐnxìn designandimplementationofatreebankdevelopmenttool
AT shawminshin shùzhuàngyǔliàokùfāzhǎngōngjùzhīshèjìyǔzhìzuò
AT xiāomǐnxìn shùzhuàngyǔliàokùfāzhǎngōngjùzhīshèjìyǔzhìzuò
_version_ 1718346086904496128