Design and Implementation of a Treebank Development Tool
碩士 === 國立臺灣大學 === 資訊工程研究所 === 84 === The syntactic analyzed corpus bring much linguistic information such that they can be use in building a lexicography, speech recognition system, statistical translation system, etc. Before take the adva...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
1996
|
Online Access: | http://ndltd.ncl.edu.tw/handle/10766675651666895390 |
id |
ndltd-TW-084NTU00392054 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-084NTU003920542016-07-13T04:10:50Z http://ndltd.ncl.edu.tw/handle/10766675651666895390 Design and Implementation of a Treebank Development Tool 樹狀語料庫發展工具之設計與製作 Shaw,Min-Shin 蕭敏信 碩士 國立臺灣大學 資訊工程研究所 84 The syntactic analyzed corpus bring much linguistic information such that they can be use in building a lexicography, speech recognition system, statistical translation system, etc. Before take the advantage of syntactic analyzed corpus, it is important to construct a large scale corpus with high quality. In this thesis, we describe the major problem of constructing the NTU corpus, the inconsistency, how to find the possible inconsistency of whole corpus. And the performance of find the possible inconsistency. We also describe the techniques used to develop a Chinese treebank development system. Setting main constituent, marking uncertain constituent, etc. can be used to improve the quality of the possible parsing tree. The graphic interface shows the structure outline of parsing tree. The structure of parsing tree is also easy to modify. The whole system is developed on MS-Windows. Chen,Hsin-Hsi 陳信希 1996 學位論文 ; thesis 74 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 資訊工程研究所 === 84 === The syntactic analyzed corpus bring much linguistic information
such that they can be use in building a lexicography, speech
recognition system, statistical translation system, etc. Before
take the advantage of syntactic analyzed corpus, it is
important to construct a large scale corpus with high quality.
In this thesis, we describe the major problem of constructing
the NTU corpus, the inconsistency, how to find the possible
inconsistency of whole corpus. And the performance of find the
possible inconsistency. We also describe the techniques used to
develop a Chinese treebank development system. Setting main
constituent, marking uncertain constituent, etc. can be used to
improve the quality of the possible parsing tree. The graphic
interface shows the structure outline of parsing tree. The
structure of parsing tree is also easy to modify. The whole
system is developed on MS-Windows.
|
author2 |
Chen,Hsin-Hsi |
author_facet |
Chen,Hsin-Hsi Shaw,Min-Shin 蕭敏信 |
author |
Shaw,Min-Shin 蕭敏信 |
spellingShingle |
Shaw,Min-Shin 蕭敏信 Design and Implementation of a Treebank Development Tool |
author_sort |
Shaw,Min-Shin |
title |
Design and Implementation of a Treebank Development Tool |
title_short |
Design and Implementation of a Treebank Development Tool |
title_full |
Design and Implementation of a Treebank Development Tool |
title_fullStr |
Design and Implementation of a Treebank Development Tool |
title_full_unstemmed |
Design and Implementation of a Treebank Development Tool |
title_sort |
design and implementation of a treebank development tool |
publishDate |
1996 |
url |
http://ndltd.ncl.edu.tw/handle/10766675651666895390 |
work_keys_str_mv |
AT shawminshin designandimplementationofatreebankdevelopmenttool AT xiāomǐnxìn designandimplementationofatreebankdevelopmenttool AT shawminshin shùzhuàngyǔliàokùfāzhǎngōngjùzhīshèjìyǔzhìzuò AT xiāomǐnxìn shùzhuàngyǔliàokùfāzhǎngōngjùzhīshèjìyǔzhìzuò |
_version_ |
1718346086904496128 |