Computational models of coherence for open-domain dialogue

Coherence is the quality that gives a text its conceptual unity, making a text a coordinated set of connected parts rather than a random group of sentences (turns, in the case of dialogue). Hence, coherence is an integral property of human communication, necessary for a meaningful discourse both in...

Full description

Bibliographic Details
Main Author: Cervone, Alessandra
Other Authors: Riccardi, Giuseppe
Format: Doctoral Thesis
Language:English
Published: Università degli studi di Trento 2020
Subjects:
Online Access:http://hdl.handle.net/11572/276165
id ndltd-unitn.it-oai-iris.unitn.it-11572-276165
record_format oai_dc
spelling ndltd-unitn.it-oai-iris.unitn.it-11572-2761652021-01-21T05:22:17Z Computational models of coherence for open-domain dialogue Cervone, Alessandra Riccardi, Giuseppe Coherence Dialogue Discourse Conversational AI Natural Language Processing Coherence is the quality that gives a text its conceptual unity, making a text a coordinated set of connected parts rather than a random group of sentences (turns, in the case of dialogue). Hence, coherence is an integral property of human communication, necessary for a meaningful discourse both in text and dialogue. As such, coherence can be regarded as a requirement for conversational agents, i.e. machines designed to converse with humans. Though recently there has been a proliferation in the usage and popularity of conversational agents, dialogue coherence is still a relatively neglected area of research, and coherence across multiple turns of a dialogue remains an open challenge for current conversational AI research. As conversational agents progress from being able to handle a single application domain to multiple ones through any domain (open-domain), the range of possible dialogue paths increases, and thus the problem of maintaining multi-turn coherence becomes especially critical. In this thesis, we investigate two aspects of coherence in dialogue and how they can be used to design modules for an open-domain coherent conversational agent. In particular, our approach focuses on modeling intentional and thematic information patterns of distribution as proxies for a coherent discourse in open-domain dialogue. While for modeling intentional information we employ Dialogue Acts (DA) theory (Bunt, 2009); for modeling thematic information we rely on open-domain entities (Barzilay and Lapata, 2008). We find that DAs and entities play a fundamental role in modelling dialogue coherence both independently and jointly, and that they can be used to model different components of an open-domain conversational agent architecture, such as Spoken Language Understanding, Dialogue Management, Natural Language Generation, and open-domain dialogue evaluation. The main contributions of this thesis are: (I) we present an open-domain modular conversational agent architecture based on entity and DA structures designed for coherence and engagement; (II) we propose a methodology for training an open-domain DA tagger compliant with the ISO 24617-2 standard (Bunt et al., 2012) combining multiple resources; (III) we propose different models, and a corpus, for predicting open-domain dialogue coherence using DA and entity information trained with weakly supervised techniques, first at the conversation level and then at the turn level; (IV) we present supervised approaches for automatic evaluation of open-domain conversation exploiting DA and entity information, both at the conversation level and at the turn level; (V) we present experiments with Natural Language Generation models that generate text from Meaning Representation structures composed of DAs and slots for an open-domain setting. 2020-10-08 info:eu-repo/semantics/doctoralThesis http://hdl.handle.net/11572/276165 10.15168/11572_276165 info:eu-repo/semantics/altIdentifier/hdl/11572/276165 eng firstpage:1 lastpage:152 numberofpages:152 info:eu-repo/semantics/openAccess Università degli studi di Trento place:Trento
collection NDLTD
language English
format Doctoral Thesis
sources NDLTD
topic Coherence Dialogue Discourse Conversational AI Natural Language Processing
spellingShingle Coherence Dialogue Discourse Conversational AI Natural Language Processing
Cervone, Alessandra
Computational models of coherence for open-domain dialogue
description Coherence is the quality that gives a text its conceptual unity, making a text a coordinated set of connected parts rather than a random group of sentences (turns, in the case of dialogue). Hence, coherence is an integral property of human communication, necessary for a meaningful discourse both in text and dialogue. As such, coherence can be regarded as a requirement for conversational agents, i.e. machines designed to converse with humans. Though recently there has been a proliferation in the usage and popularity of conversational agents, dialogue coherence is still a relatively neglected area of research, and coherence across multiple turns of a dialogue remains an open challenge for current conversational AI research. As conversational agents progress from being able to handle a single application domain to multiple ones through any domain (open-domain), the range of possible dialogue paths increases, and thus the problem of maintaining multi-turn coherence becomes especially critical. In this thesis, we investigate two aspects of coherence in dialogue and how they can be used to design modules for an open-domain coherent conversational agent. In particular, our approach focuses on modeling intentional and thematic information patterns of distribution as proxies for a coherent discourse in open-domain dialogue. While for modeling intentional information we employ Dialogue Acts (DA) theory (Bunt, 2009); for modeling thematic information we rely on open-domain entities (Barzilay and Lapata, 2008). We find that DAs and entities play a fundamental role in modelling dialogue coherence both independently and jointly, and that they can be used to model different components of an open-domain conversational agent architecture, such as Spoken Language Understanding, Dialogue Management, Natural Language Generation, and open-domain dialogue evaluation. The main contributions of this thesis are: (I) we present an open-domain modular conversational agent architecture based on entity and DA structures designed for coherence and engagement; (II) we propose a methodology for training an open-domain DA tagger compliant with the ISO 24617-2 standard (Bunt et al., 2012) combining multiple resources; (III) we propose different models, and a corpus, for predicting open-domain dialogue coherence using DA and entity information trained with weakly supervised techniques, first at the conversation level and then at the turn level; (IV) we present supervised approaches for automatic evaluation of open-domain conversation exploiting DA and entity information, both at the conversation level and at the turn level; (V) we present experiments with Natural Language Generation models that generate text from Meaning Representation structures composed of DAs and slots for an open-domain setting.
author2 Riccardi, Giuseppe
author_facet Riccardi, Giuseppe
Cervone, Alessandra
author Cervone, Alessandra
author_sort Cervone, Alessandra
title Computational models of coherence for open-domain dialogue
title_short Computational models of coherence for open-domain dialogue
title_full Computational models of coherence for open-domain dialogue
title_fullStr Computational models of coherence for open-domain dialogue
title_full_unstemmed Computational models of coherence for open-domain dialogue
title_sort computational models of coherence for open-domain dialogue
publisher Università degli studi di Trento
publishDate 2020
url http://hdl.handle.net/11572/276165
work_keys_str_mv AT cervonealessandra computationalmodelsofcoherenceforopendomaindialogue
_version_ 1719373613660897280