Computational models of coherence for open-domain dialogue

Coherence is the quality that gives a text its conceptual unity, making a text a coordinated set of connected parts rather than a random group of sentences (turns, in the case of dialogue). Hence, coherence is an integral property of human communication, necessary for a meaningful discourse both in...

Full description

Bibliographic Details
Main Author:	Cervone, Alessandra
Other Authors:	Riccardi, Giuseppe
Format:	Doctoral Thesis
Language:	English
Published:	Università degli studi di Trento 2020
Subjects:	Coherence Dialogue Discourse Conversational AI Natural Language Processing
Online Access:	http://hdl.handle.net/11572/276165

id	ndltd-unitn.it-oai-iris.unitn.it-11572-276165
record_format	oai_dc
spelling	ndltd-unitn.it-oai-iris.unitn.it-11572-2761652021-01-21T05:22:17Z Computational models of coherence for open-domain dialogue Cervone, Alessandra Riccardi, Giuseppe Coherence Dialogue Discourse Conversational AI Natural Language Processing Coherence is the quality that gives a text its conceptual unity, making a text a coordinated set of connected parts rather than a random group of sentences (turns, in the case of dialogue). Hence, coherence is an integral property of human communication, necessary for a meaningful discourse both in text and dialogue. As such, coherence can be regarded as a requirement for conversational agents, i.e. machines designed to converse with humans. Though recently there has been a proliferation in the usage and popularity of conversational agents, dialogue coherence is still a relatively neglected area of research, and coherence across multiple turns of a dialogue remains an open challenge for current conversational AI research. As conversational agents progress from being able to handle a single application domain to multiple ones through any domain (open-domain), the range of possible dialogue paths increases, and thus the problem of maintaining multi-turn coherence becomes especially critical. In this thesis, we investigate two aspects of coherence in dialogue and how they can be used to design modules for an open-domain coherent conversational agent. In particular, our approach focuses on modeling intentional and thematic information patterns of distribution as proxies for a coherent discourse in open-domain dialogue. While for modeling intentional information we employ Dialogue Acts (DA) theory (Bunt, 2009); for modeling thematic information we rely on open-domain entities (Barzilay and Lapata, 2008). We find that DAs and entities play a fundamental role in modelling dialogue coherence both independently and jointly, and that they can be used to model different components of an open-domain conversational agent architecture, such as Spoken Language Understanding, Dialogue Management, Natural Language Generation, and open-domain dialogue evaluation. The main contributions of this thesis are: (I) we present an open-domain modular conversational agent architecture based on entity and DA structures designed for coherence and engagement; (II) we propose a methodology for training an open-domain DA tagger compliant with the ISO 24617-2 standard (Bunt et al., 2012) combining multiple resources; (III) we propose different models, and a corpus, for predicting open-domain dialogue coherence using DA and entity information trained with weakly supervised techniques, first at the conversation level and then at the turn level; (IV) we present supervised approaches for automatic evaluation of open-domain conversation exploiting DA and entity information, both at the conversation level and at the turn level; (V) we present experiments with Natural Language Generation models that generate text from Meaning Representation structures composed of DAs and slots for an open-domain setting. 2020-10-08 info:eu-repo/semantics/doctoralThesis http://hdl.handle.net/11572/276165 10.15168/11572_276165 info:eu-repo/semantics/altIdentifier/hdl/11572/276165 eng firstpage:1 lastpage:152 numberofpages:152 info:eu-repo/semantics/openAccess Università degli studi di Trento place:Trento
collection	NDLTD
language	English
format	Doctoral Thesis
sources	NDLTD
topic	Coherence Dialogue Discourse Conversational AI Natural Language Processing
spellingShingle	Coherence Dialogue Discourse Conversational AI Natural Language Processing Cervone, Alessandra Computational models of coherence for open-domain dialogue
description	Coherence is the quality that gives a text its conceptual unity, making a text a coordinated set of connected parts rather than a random group of sentences (turns, in the case of dialogue). Hence, coherence is an integral property of human communication, necessary for a meaningful discourse both in text and dialogue. As such, coherence can be regarded as a requirement for conversational agents, i.e. machines designed to converse with humans. Though recently there has been a proliferation in the usage and popularity of conversational agents, dialogue coherence is still a relatively neglected area of research, and coherence across multiple turns of a dialogue remains an open challenge for current conversational AI research. As conversational agents progress from being able to handle a single application domain to multiple ones through any domain (open-domain), the range of possible dialogue paths increases, and thus the problem of maintaining multi-turn coherence becomes especially critical. In this thesis, we investigate two aspects of coherence in dialogue and how they can be used to design modules for an open-domain coherent conversational agent. In particular, our approach focuses on modeling intentional and thematic information patterns of distribution as proxies for a coherent discourse in open-domain dialogue. While for modeling intentional information we employ Dialogue Acts (DA) theory (Bunt, 2009); for modeling thematic information we rely on open-domain entities (Barzilay and Lapata, 2008). We find that DAs and entities play a fundamental role in modelling dialogue coherence both independently and jointly, and that they can be used to model different components of an open-domain conversational agent architecture, such as Spoken Language Understanding, Dialogue Management, Natural Language Generation, and open-domain dialogue evaluation. The main contributions of this thesis are: (I) we present an open-domain modular conversational agent architecture based on entity and DA structures designed for coherence and engagement; (II) we propose a methodology for training an open-domain DA tagger compliant with the ISO 24617-2 standard (Bunt et al., 2012) combining multiple resources; (III) we propose different models, and a corpus, for predicting open-domain dialogue coherence using DA and entity information trained with weakly supervised techniques, first at the conversation level and then at the turn level; (IV) we present supervised approaches for automatic evaluation of open-domain conversation exploiting DA and entity information, both at the conversation level and at the turn level; (V) we present experiments with Natural Language Generation models that generate text from Meaning Representation structures composed of DAs and slots for an open-domain setting.
author2	Riccardi, Giuseppe
author_facet	Riccardi, Giuseppe Cervone, Alessandra
author	Cervone, Alessandra
author_sort	Cervone, Alessandra
title	Computational models of coherence for open-domain dialogue
title_short	Computational models of coherence for open-domain dialogue
title_full	Computational models of coherence for open-domain dialogue
title_fullStr	Computational models of coherence for open-domain dialogue
title_full_unstemmed	Computational models of coherence for open-domain dialogue
title_sort	computational models of coherence for open-domain dialogue
publisher	Università degli studi di Trento
publishDate	2020
url	http://hdl.handle.net/11572/276165
work_keys_str_mv	AT cervonealessandra computationalmodelsofcoherenceforopendomaindialogue
_version_	1719373613660897280

Computational models of coherence for open-domain dialogue

Similar Items