Computational models of coherence for open-domain dialogue
Coherence is the quality that gives a text its conceptual unity, making a text a coordinated set of connected parts rather than a random group of sentences (turns, in the case of dialogue). Hence, coherence is an integral property of human communication, necessary for a meaningful discourse both in...
Main Author: | |
---|---|
Other Authors: | |
Format: | Doctoral Thesis |
Language: | English |
Published: |
Università degli studi di Trento
2020
|
Subjects: | |
Online Access: | http://hdl.handle.net/11572/276165 |
id |
ndltd-unitn.it-oai-iris.unitn.it-11572-276165 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-unitn.it-oai-iris.unitn.it-11572-2761652021-01-21T05:22:17Z Computational models of coherence for open-domain dialogue Cervone, Alessandra Riccardi, Giuseppe Coherence Dialogue Discourse Conversational AI Natural Language Processing Coherence is the quality that gives a text its conceptual unity, making a text a coordinated set of connected parts rather than a random group of sentences (turns, in the case of dialogue). Hence, coherence is an integral property of human communication, necessary for a meaningful discourse both in text and dialogue. As such, coherence can be regarded as a requirement for conversational agents, i.e. machines designed to converse with humans. Though recently there has been a proliferation in the usage and popularity of conversational agents, dialogue coherence is still a relatively neglected area of research, and coherence across multiple turns of a dialogue remains an open challenge for current conversational AI research. As conversational agents progress from being able to handle a single application domain to multiple ones through any domain (open-domain), the range of possible dialogue paths increases, and thus the problem of maintaining multi-turn coherence becomes especially critical. In this thesis, we investigate two aspects of coherence in dialogue and how they can be used to design modules for an open-domain coherent conversational agent. In particular, our approach focuses on modeling intentional and thematic information patterns of distribution as proxies for a coherent discourse in open-domain dialogue. While for modeling intentional information we employ Dialogue Acts (DA) theory (Bunt, 2009); for modeling thematic information we rely on open-domain entities (Barzilay and Lapata, 2008). We find that DAs and entities play a fundamental role in modelling dialogue coherence both independently and jointly, and that they can be used to model different components of an open-domain conversational agent architecture, such as Spoken Language Understanding, Dialogue Management, Natural Language Generation, and open-domain dialogue evaluation. The main contributions of this thesis are: (I) we present an open-domain modular conversational agent architecture based on entity and DA structures designed for coherence and engagement; (II) we propose a methodology for training an open-domain DA tagger compliant with the ISO 24617-2 standard (Bunt et al., 2012) combining multiple resources; (III) we propose different models, and a corpus, for predicting open-domain dialogue coherence using DA and entity information trained with weakly supervised techniques, first at the conversation level and then at the turn level; (IV) we present supervised approaches for automatic evaluation of open-domain conversation exploiting DA and entity information, both at the conversation level and at the turn level; (V) we present experiments with Natural Language Generation models that generate text from Meaning Representation structures composed of DAs and slots for an open-domain setting. 2020-10-08 info:eu-repo/semantics/doctoralThesis http://hdl.handle.net/11572/276165 10.15168/11572_276165 info:eu-repo/semantics/altIdentifier/hdl/11572/276165 eng firstpage:1 lastpage:152 numberofpages:152 info:eu-repo/semantics/openAccess Università degli studi di Trento place:Trento |
collection |
NDLTD |
language |
English |
format |
Doctoral Thesis |
sources |
NDLTD |
topic |
Coherence
Dialogue
Discourse
Conversational AI
Natural Language Processing |
spellingShingle |
Coherence
Dialogue
Discourse
Conversational AI
Natural Language Processing Cervone, Alessandra Computational models of coherence for open-domain dialogue |
description |
Coherence is the quality that gives a text its conceptual unity, making a text a coordinated set of connected parts rather than a random group of sentences (turns, in the case of dialogue). Hence, coherence is an integral property of human communication, necessary for a meaningful discourse both in text and dialogue. As such, coherence can be regarded as a requirement for conversational agents, i.e. machines designed to converse with humans. Though recently there has been a proliferation in the usage and popularity of conversational agents, dialogue coherence is still a relatively neglected area of research, and coherence across multiple turns of a dialogue remains an open challenge for current conversational AI research. As conversational agents progress from being able to handle a single application domain to multiple ones through any domain (open-domain), the range of possible dialogue paths increases, and thus the problem of maintaining multi-turn coherence becomes especially critical.
In this thesis, we investigate two aspects of coherence in dialogue and how they can be used to design modules for an open-domain coherent conversational agent. In particular, our approach focuses on modeling intentional and thematic information patterns of distribution as proxies for a coherent discourse in open-domain dialogue. While for modeling intentional information we employ Dialogue Acts (DA) theory (Bunt, 2009); for modeling thematic information we rely on open-domain entities (Barzilay and Lapata, 2008). We find that DAs and entities play a fundamental role in modelling dialogue coherence both independently and jointly, and that they can be used to model different components of an open-domain conversational agent architecture, such as Spoken Language Understanding, Dialogue Management, Natural Language Generation, and open-domain dialogue evaluation.
The main contributions of this thesis are: (I) we present an open-domain modular conversational agent architecture based on entity and DA structures designed for coherence and engagement; (II) we propose a methodology for training an open-domain DA tagger compliant with the ISO 24617-2 standard (Bunt et al., 2012) combining multiple resources; (III) we propose different models, and a corpus, for predicting open-domain dialogue coherence using DA and entity information trained with weakly supervised techniques, first at the conversation level and then at the turn level; (IV) we present supervised approaches for automatic evaluation of open-domain conversation exploiting DA and entity information, both at the conversation level and at the turn level; (V) we present experiments with Natural Language Generation models that generate text from Meaning Representation structures composed of DAs and slots for an open-domain setting. |
author2 |
Riccardi, Giuseppe |
author_facet |
Riccardi, Giuseppe Cervone, Alessandra |
author |
Cervone, Alessandra |
author_sort |
Cervone, Alessandra |
title |
Computational models of coherence for open-domain dialogue |
title_short |
Computational models of coherence for open-domain dialogue |
title_full |
Computational models of coherence for open-domain dialogue |
title_fullStr |
Computational models of coherence for open-domain dialogue |
title_full_unstemmed |
Computational models of coherence for open-domain dialogue |
title_sort |
computational models of coherence for open-domain dialogue |
publisher |
Università degli studi di Trento |
publishDate |
2020 |
url |
http://hdl.handle.net/11572/276165 |
work_keys_str_mv |
AT cervonealessandra computationalmodelsofcoherenceforopendomaindialogue |
_version_ |
1719373613660897280 |