Interpretation of anaphoric expressions in the Lolita system

This thesis addresses the issue of anaphora resolution in the large scale natural language system, LOLITA. The work described here involved a thorough analysis of the system’s initial performance, the collection of evidence for and the design of the new anaphora resolution algorithm, and subsequent...

Full description

Bibliographic Details
Main Author: Urbanowicz, Agnieszka Joanna
Published: Durham University 1998
Subjects:
005
Online Access:http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.286464
id ndltd-bl.uk-oai-ethos.bl.uk-286464
record_format oai_dc
spelling ndltd-bl.uk-oai-ethos.bl.uk-2864642015-03-19T05:41:19ZInterpretation of anaphoric expressions in the Lolita systemUrbanowicz, Agnieszka Joanna1998This thesis addresses the issue of anaphora resolution in the large scale natural language system, LOLITA. The work described here involved a thorough analysis of the system’s initial performance, the collection of evidence for and the design of the new anaphora resolution algorithm, and subsequent implementation and evaluation of the system. Anaphoric expressions are elements of a discourse whose resolution depends on other elements of the preceding discourse. The processes involved in anaphora resolution have long been the subject of research in a variety of fields. The changes carried out to LOLITA first involved substantial improvements to the core, lower level modules which form the basis of the system. A major change specific to the interpretation of anaphoric expressions was then introduced. A system of filters, in which potential candidates for resolution are filtered according to a set of heuristics, has been changed to a system of penalties, where candidates accumulate points throughout the application of the heuristics. At the end of the process, the candidate with the smallest penalty is chosen as a referent. New heuristics, motivated by evidence drawn from research in linguistics, psycholinguistics and AI, have been added to the system. The system was evaluated using a procedure similar to that defined by MUC6 (DARPA 1995). Blind and open tests were used. The first evaluation was carried out after the general improvements to the lower level modules; the second after the introduction of the new anaphora algorithm. It was found that the general improvements led to a considerable rise in scores in both the blind and the open test sets. As a result of the anaphora specific improvements, on the other hand, the rise in scores on the open set was larger than the rise on the blind set. In the open set the category of pronouns showed the most marked improvement. It was concluded that it is the work carried out to the basic, lower level modules of a large scale system which leads to biggest gains. It was also concluded that considerable extra advantage can be gained by using the new weights-based algorithm together with the generally improved system.005Natural language engineeringDurham Universityhttp://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.286464http://etheses.dur.ac.uk/5001/Electronic Thesis or Dissertation
collection NDLTD
sources NDLTD
topic 005
Natural language engineering
spellingShingle 005
Natural language engineering
Urbanowicz, Agnieszka Joanna
Interpretation of anaphoric expressions in the Lolita system
description This thesis addresses the issue of anaphora resolution in the large scale natural language system, LOLITA. The work described here involved a thorough analysis of the system’s initial performance, the collection of evidence for and the design of the new anaphora resolution algorithm, and subsequent implementation and evaluation of the system. Anaphoric expressions are elements of a discourse whose resolution depends on other elements of the preceding discourse. The processes involved in anaphora resolution have long been the subject of research in a variety of fields. The changes carried out to LOLITA first involved substantial improvements to the core, lower level modules which form the basis of the system. A major change specific to the interpretation of anaphoric expressions was then introduced. A system of filters, in which potential candidates for resolution are filtered according to a set of heuristics, has been changed to a system of penalties, where candidates accumulate points throughout the application of the heuristics. At the end of the process, the candidate with the smallest penalty is chosen as a referent. New heuristics, motivated by evidence drawn from research in linguistics, psycholinguistics and AI, have been added to the system. The system was evaluated using a procedure similar to that defined by MUC6 (DARPA 1995). Blind and open tests were used. The first evaluation was carried out after the general improvements to the lower level modules; the second after the introduction of the new anaphora algorithm. It was found that the general improvements led to a considerable rise in scores in both the blind and the open test sets. As a result of the anaphora specific improvements, on the other hand, the rise in scores on the open set was larger than the rise on the blind set. In the open set the category of pronouns showed the most marked improvement. It was concluded that it is the work carried out to the basic, lower level modules of a large scale system which leads to biggest gains. It was also concluded that considerable extra advantage can be gained by using the new weights-based algorithm together with the generally improved system.
author Urbanowicz, Agnieszka Joanna
author_facet Urbanowicz, Agnieszka Joanna
author_sort Urbanowicz, Agnieszka Joanna
title Interpretation of anaphoric expressions in the Lolita system
title_short Interpretation of anaphoric expressions in the Lolita system
title_full Interpretation of anaphoric expressions in the Lolita system
title_fullStr Interpretation of anaphoric expressions in the Lolita system
title_full_unstemmed Interpretation of anaphoric expressions in the Lolita system
title_sort interpretation of anaphoric expressions in the lolita system
publisher Durham University
publishDate 1998
url http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.286464
work_keys_str_mv AT urbanowiczagnieszkajoanna interpretationofanaphoricexpressionsinthelolitasystem
_version_ 1716742199464427520