Analytical and Tectogrammatical Analysis of a Natural Language
The thesis presents tools for analysis at analytical and tectogrammatical layers that the Prague Dependency Treebank is based on. The tools for analytical annotation consist of two parsers and a tool for assigning syntactic tags. Although the performance of the parsers is far below that of the state...
Main Author: | |
---|---|
Other Authors: | |
Format: | Doctoral Thesis |
Language: | English |
Published: |
2006
|
Online Access: | http://www.nusl.cz/ntk/nusl-269995 |
id |
ndltd-nusl.cz-oai-invenio.nusl.cz-269995 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-nusl.cz-oai-invenio.nusl.cz-2699952018-12-10T04:16:11Z Analytical and Tectogrammatical Analysis of a Natural Language Analytical and Tectogrammatical Analysis of a Natural Language Klimeš, Václav Hajič, Jan Pala, Karel Ribarov, Kiril The thesis presents tools for analysis at analytical and tectogrammatical layers that the Prague Dependency Treebank is based on. The tools for analytical annotation consist of two parsers and a tool for assigning syntactic tags. Although the performance of the parsers is far below that of the state-of-the-art parsers, they both can be considered a certain contribution to parsing, since the methods they are based on are novel. The tool for assigning syntactic tags makes 15% less errors than a tool used for this purpose previously. The tool developed for tectogrammatical annotation is the only one that can currently perform this task in such a breadth. Although other, specialized tools may have a better performance of some of its particular subtasks, my tool makes 29% and 47% less errors for the Czech language than the combination of existing tools for annotating the tectogrammatical structure and deep functors, respectively, which are the core of the tectogrammatical layer. The proposed tools are designed the way they can be used for other languages as well. 2006 info:eu-repo/semantics/doctoralThesis http://www.nusl.cz/ntk/nusl-269995 eng info:eu-repo/semantics/restrictedAccess |
collection |
NDLTD |
language |
English |
format |
Doctoral Thesis |
sources |
NDLTD |
description |
The thesis presents tools for analysis at analytical and tectogrammatical layers that the Prague Dependency Treebank is based on. The tools for analytical annotation consist of two parsers and a tool for assigning syntactic tags. Although the performance of the parsers is far below that of the state-of-the-art parsers, they both can be considered a certain contribution to parsing, since the methods they are based on are novel. The tool for assigning syntactic tags makes 15% less errors than a tool used for this purpose previously. The tool developed for tectogrammatical annotation is the only one that can currently perform this task in such a breadth. Although other, specialized tools may have a better performance of some of its particular subtasks, my tool makes 29% and 47% less errors for the Czech language than the combination of existing tools for annotating the tectogrammatical structure and deep functors, respectively, which are the core of the tectogrammatical layer. The proposed tools are designed the way they can be used for other languages as well. |
author2 |
Hajič, Jan |
author_facet |
Hajič, Jan Klimeš, Václav |
author |
Klimeš, Václav |
spellingShingle |
Klimeš, Václav Analytical and Tectogrammatical Analysis of a Natural Language |
author_sort |
Klimeš, Václav |
title |
Analytical and Tectogrammatical Analysis of a Natural Language |
title_short |
Analytical and Tectogrammatical Analysis of a Natural Language |
title_full |
Analytical and Tectogrammatical Analysis of a Natural Language |
title_fullStr |
Analytical and Tectogrammatical Analysis of a Natural Language |
title_full_unstemmed |
Analytical and Tectogrammatical Analysis of a Natural Language |
title_sort |
analytical and tectogrammatical analysis of a natural language |
publishDate |
2006 |
url |
http://www.nusl.cz/ntk/nusl-269995 |
work_keys_str_mv |
AT klimesvaclav analyticalandtectogrammaticalanalysisofanaturallanguage |
_version_ |
1718799915795087360 |