Analytical and Tectogrammatical Analysis of a Natural Language

The thesis presents tools for analysis at analytical and tectogrammatical layers that the Prague Dependency Treebank is based on. The tools for analytical annotation consist of two parsers and a tool for assigning syntactic tags. Although the performance of the parsers is far below that of the state...

Full description

Bibliographic Details
Main Author: Klimeš, Václav
Other Authors: Hajič, Jan
Format: Doctoral Thesis
Language:English
Published: 2006
Online Access:http://www.nusl.cz/ntk/nusl-269995
id ndltd-nusl.cz-oai-invenio.nusl.cz-269995
record_format oai_dc
spelling ndltd-nusl.cz-oai-invenio.nusl.cz-2699952018-12-10T04:16:11Z Analytical and Tectogrammatical Analysis of a Natural Language Analytical and Tectogrammatical Analysis of a Natural Language Klimeš, Václav Hajič, Jan Pala, Karel Ribarov, Kiril The thesis presents tools for analysis at analytical and tectogrammatical layers that the Prague Dependency Treebank is based on. The tools for analytical annotation consist of two parsers and a tool for assigning syntactic tags. Although the performance of the parsers is far below that of the state-of-the-art parsers, they both can be considered a certain contribution to parsing, since the methods they are based on are novel. The tool for assigning syntactic tags makes 15% less errors than a tool used for this purpose previously. The tool developed for tectogrammatical annotation is the only one that can currently perform this task in such a breadth. Although other, specialized tools may have a better performance of some of its particular subtasks, my tool makes 29% and 47% less errors for the Czech language than the combination of existing tools for annotating the tectogrammatical structure and deep functors, respectively, which are the core of the tectogrammatical layer. The proposed tools are designed the way they can be used for other languages as well. 2006 info:eu-repo/semantics/doctoralThesis http://www.nusl.cz/ntk/nusl-269995 eng info:eu-repo/semantics/restrictedAccess
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
description The thesis presents tools for analysis at analytical and tectogrammatical layers that the Prague Dependency Treebank is based on. The tools for analytical annotation consist of two parsers and a tool for assigning syntactic tags. Although the performance of the parsers is far below that of the state-of-the-art parsers, they both can be considered a certain contribution to parsing, since the methods they are based on are novel. The tool for assigning syntactic tags makes 15% less errors than a tool used for this purpose previously. The tool developed for tectogrammatical annotation is the only one that can currently perform this task in such a breadth. Although other, specialized tools may have a better performance of some of its particular subtasks, my tool makes 29% and 47% less errors for the Czech language than the combination of existing tools for annotating the tectogrammatical structure and deep functors, respectively, which are the core of the tectogrammatical layer. The proposed tools are designed the way they can be used for other languages as well.
author2 Hajič, Jan
author_facet Hajič, Jan
Klimeš, Václav
author Klimeš, Václav
spellingShingle Klimeš, Václav
Analytical and Tectogrammatical Analysis of a Natural Language
author_sort Klimeš, Václav
title Analytical and Tectogrammatical Analysis of a Natural Language
title_short Analytical and Tectogrammatical Analysis of a Natural Language
title_full Analytical and Tectogrammatical Analysis of a Natural Language
title_fullStr Analytical and Tectogrammatical Analysis of a Natural Language
title_full_unstemmed Analytical and Tectogrammatical Analysis of a Natural Language
title_sort analytical and tectogrammatical analysis of a natural language
publishDate 2006
url http://www.nusl.cz/ntk/nusl-269995
work_keys_str_mv AT klimesvaclav analyticalandtectogrammaticalanalysisofanaturallanguage
_version_ 1718799915795087360