Tree Transformations in Inductive Dependency Parsing

This licentiate thesis deals with automatic syntactic analysis, or parsing, of natural languages. A parser constructs the syntactic analysis, which it learns by looking at correctly analyzed sentences, known as training data. The general topic concerns manipulations of the training data in order to...

Full description

Bibliographic Details
Main Author:	Nilsson, Jens
Format:	Others
Language:	English
Published:	Växjö universitet, Matematiska och systemtekniska institutionen 2007
Subjects:	Inductive Dependency Parsing Dependency Structure Tree Transformation Non-projectivity Coordination Verb Group Language Technology (Computational Linguistics) Språkteknologi (språkvetenskaplig databehandling)
Online Access:	http://urn.kb.se/resolve?urn=urn:nbn:se:vxu:diva-1205

id	ndltd-UPSALLA1-oai-DiVA.org-vxu-1205
record_format	oai_dc
spelling	ndltd-UPSALLA1-oai-DiVA.org-vxu-12052018-01-14T05:09:30ZTree Transformations in Inductive Dependency ParsingengNilsson, JensVäxjö universitet, Matematiska och systemtekniska institutionenVäxjö : Matematiska och systemtekniska institutionen2007Inductive Dependency ParsingDependency StructureTree TransformationNon-projectivityCoordinationVerb GroupLanguage Technology (Computational Linguistics)Språkteknologi (språkvetenskaplig databehandling)This licentiate thesis deals with automatic syntactic analysis, or parsing, of natural languages. A parser constructs the syntactic analysis, which it learns by looking at correctly analyzed sentences, known as training data. The general topic concerns manipulations of the training data in order to improve the parsing accuracy. Several studies using constituency-based theories for natural languages in such automatic and data-driven syntactic parsing have shown that training data, annotated according to a linguistic theory, often needs to be adapted in various ways in order to achieve an adequate, automatic analysis. A linguistically sound constituent structure is not necessarily well-suited for learning and parsing using existing data-driven methods. Modifications to the constituency-based trees in the training data, and corresponding modifications to the parser output, have successfully been applied to increase the parser accuracy. The topic of this thesis is to investigate whether similar modifications in the form of tree transformations to training data, annotated with dependency-based structures, can improve accuracy for data-driven dependency parsers. In order to do this, two types of tree transformations are in focus in this thesis. %This is a topic that so far has been less studied. The first one concerns non-projectivity. The full potential of dependency parsing can only be realized if non-projective constructions are allowed, which pose a problem for projective dependency parsers. On the other hand, non-projective parsers tend, among other things, to be slower. In order to maintain the benefits of projective parsing, a tree transformation technique to recover non-projectivity while using a projective parser is presented here. The second type of transformation concerns linguistic phenomena that are possible but hard for a parser to learn, given a certain choice of dependency analysis. This study has concentrated on two such phenomena, coordination and verb groups, for which tree transformations are applied in order to improve parsing accuracy, in case the original structure does not coincide with a structure that is easy to learn. Empirical evaluations are performed using treebank data from various languages, and using more than one dependency parser. The results show that the benefit of these tree transformations used in preprocessing and postprocessing to a large extent is language, treebank and parser independent. Licentiate thesis, monographinfo:eu-repo/semantics/masterThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:vxu:diva-1205Rapporter från MSI, 1650-2647 ; 07002application/pdfinfo:eu-repo/semantics/openAccess
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	Inductive Dependency Parsing Dependency Structure Tree Transformation Non-projectivity Coordination Verb Group Language Technology (Computational Linguistics) Språkteknologi (språkvetenskaplig databehandling)
spellingShingle	Inductive Dependency Parsing Dependency Structure Tree Transformation Non-projectivity Coordination Verb Group Language Technology (Computational Linguistics) Språkteknologi (språkvetenskaplig databehandling) Nilsson, Jens Tree Transformations in Inductive Dependency Parsing
description	This licentiate thesis deals with automatic syntactic analysis, or parsing, of natural languages. A parser constructs the syntactic analysis, which it learns by looking at correctly analyzed sentences, known as training data. The general topic concerns manipulations of the training data in order to improve the parsing accuracy. Several studies using constituency-based theories for natural languages in such automatic and data-driven syntactic parsing have shown that training data, annotated according to a linguistic theory, often needs to be adapted in various ways in order to achieve an adequate, automatic analysis. A linguistically sound constituent structure is not necessarily well-suited for learning and parsing using existing data-driven methods. Modifications to the constituency-based trees in the training data, and corresponding modifications to the parser output, have successfully been applied to increase the parser accuracy. The topic of this thesis is to investigate whether similar modifications in the form of tree transformations to training data, annotated with dependency-based structures, can improve accuracy for data-driven dependency parsers. In order to do this, two types of tree transformations are in focus in this thesis. %This is a topic that so far has been less studied. The first one concerns non-projectivity. The full potential of dependency parsing can only be realized if non-projective constructions are allowed, which pose a problem for projective dependency parsers. On the other hand, non-projective parsers tend, among other things, to be slower. In order to maintain the benefits of projective parsing, a tree transformation technique to recover non-projectivity while using a projective parser is presented here. The second type of transformation concerns linguistic phenomena that are possible but hard for a parser to learn, given a certain choice of dependency analysis. This study has concentrated on two such phenomena, coordination and verb groups, for which tree transformations are applied in order to improve parsing accuracy, in case the original structure does not coincide with a structure that is easy to learn. Empirical evaluations are performed using treebank data from various languages, and using more than one dependency parser. The results show that the benefit of these tree transformations used in preprocessing and postprocessing to a large extent is language, treebank and parser independent.
author	Nilsson, Jens
author_facet	Nilsson, Jens
author_sort	Nilsson, Jens
title	Tree Transformations in Inductive Dependency Parsing
title_short	Tree Transformations in Inductive Dependency Parsing
title_full	Tree Transformations in Inductive Dependency Parsing
title_fullStr	Tree Transformations in Inductive Dependency Parsing
title_full_unstemmed	Tree Transformations in Inductive Dependency Parsing
title_sort	tree transformations in inductive dependency parsing
publisher	Växjö universitet, Matematiska och systemtekniska institutionen
publishDate	2007
url	http://urn.kb.se/resolve?urn=urn:nbn:se:vxu:diva-1205
work_keys_str_mv	AT nilssonjens treetransformationsininductivedependencyparsing
_version_	1718609098029662208

Tree Transformations in Inductive Dependency Parsing

Similar Items