Návrh souboru pravidel pro analýzu anafor v českém jazyce

With the increasing importance of natural language processing there is growing number of research with the theme automatic anaphora resolution.. The contribution to the research on this problem is also this thesis. The aim of the work is to propose a set of rules for anaphora resolution in Czech. Th...

Full description

Bibliographic Details
Main Author: Nguy, Giang Linh
Other Authors: Hajičová, Eva
Format: Dissertation
Language:Czech
Published: 2006
Online Access:http://www.nusl.cz/ntk/nusl-269487
Description
Summary:With the increasing importance of natural language processing there is growing number of research with the theme automatic anaphora resolution.. The contribution to the research on this problem is also this thesis. The aim of the work is to propose a set of rules for anaphora resolution in Czech. The created set of rules consists of handwritten rules as well as rules developped with the aid of machine learning system C4.5. For the rules training and testing were used anoted data from the Prague Dependency Treebank, in which following types of anaphora are captured: pronominal anaphora, control, reciprocity and dependency relation of adjuncts. Our work is focused on these types of anaphora. The evaluation of the rules is done with standard methods for interpretation of recall and precision.