Acoustic-Prosodic Entrainment in Human-Human and Human-Computer Dialogue

Entrainment (sometimes called adaptation or alignment) is the tendency of human speakers to adapt to or imitate characteristics of their interlocutors' behavior. This work focuses on entrainment on acoustic-prosodic features. Acoustic-prosodic entrainment has been extensively studied but is not...

Full description

Bibliographic Details
Main Author: Levitan, Rivka
Language:English
Published: 2014
Subjects:
Online Access:https://doi.org/10.7916/D8GT5KCH
id ndltd-columbia.edu-oai-academiccommons.columbia.edu-10.7916-D8GT5KCH
record_format oai_dc
spelling ndltd-columbia.edu-oai-academiccommons.columbia.edu-10.7916-D8GT5KCH2019-05-09T15:14:33ZAcoustic-Prosodic Entrainment in Human-Human and Human-Computer DialogueLevitan, Rivka2014ThesesComputer scienceHuman-computer interactionSocial interactionLanguage and languages--RhythmEntrainment (sometimes called adaptation or alignment) is the tendency of human speakers to adapt to or imitate characteristics of their interlocutors' behavior. This work focuses on entrainment on acoustic-prosodic features. Acoustic-prosodic entrainment has been extensively studied but is not well understood. In particular, it is difficult to compare the results of different studies, since entrainment is usually measured in different ways, reflect- ing disparate conceptualizations of the phenomenon. In the first part of this thesis, we look for evidence of entrainment on a variety of acoustic-prosodic features according to various conceptualizations, and show that human speakers of both Standard American English and Mandarin Chinese entrain to each other globally and locally, in synchrony, and that this entrainment can be constant or convergent. We explore the relationship between entrainment and gender and show that entrainment on some acoustic-prosodic features is related to social behavior and dialogue coordination. In addition, we show that humans entrain in a novel domain, backchannel-inviting cues, and propose and test a novel hypothesis: that entrainment will be stronger in the case of an outlier feature value. In the second part of the thesis, we describe a method for flexibly and dynamically entraining a TTS voice to multiple acoustic-prosodic features of a user's input utterances, and show in an exploratory study that users prefer an entraining avatar to one that does not entrain, are more likely to ask its advice, and choose more positive adjectives to describe its voice. This work introduces a coherent view of entrainment in both familiar and novel domains. Our results add to the body of knowledge of entrainment in human-human conversations and propose new directions for making use of that knowledge to enhance human-computer interactions.Englishhttps://doi.org/10.7916/D8GT5KCH
collection NDLTD
language English
sources NDLTD
topic Computer science
Human-computer interaction
Social interaction
Language and languages--Rhythm
spellingShingle Computer science
Human-computer interaction
Social interaction
Language and languages--Rhythm
Levitan, Rivka
Acoustic-Prosodic Entrainment in Human-Human and Human-Computer Dialogue
description Entrainment (sometimes called adaptation or alignment) is the tendency of human speakers to adapt to or imitate characteristics of their interlocutors' behavior. This work focuses on entrainment on acoustic-prosodic features. Acoustic-prosodic entrainment has been extensively studied but is not well understood. In particular, it is difficult to compare the results of different studies, since entrainment is usually measured in different ways, reflect- ing disparate conceptualizations of the phenomenon. In the first part of this thesis, we look for evidence of entrainment on a variety of acoustic-prosodic features according to various conceptualizations, and show that human speakers of both Standard American English and Mandarin Chinese entrain to each other globally and locally, in synchrony, and that this entrainment can be constant or convergent. We explore the relationship between entrainment and gender and show that entrainment on some acoustic-prosodic features is related to social behavior and dialogue coordination. In addition, we show that humans entrain in a novel domain, backchannel-inviting cues, and propose and test a novel hypothesis: that entrainment will be stronger in the case of an outlier feature value. In the second part of the thesis, we describe a method for flexibly and dynamically entraining a TTS voice to multiple acoustic-prosodic features of a user's input utterances, and show in an exploratory study that users prefer an entraining avatar to one that does not entrain, are more likely to ask its advice, and choose more positive adjectives to describe its voice. This work introduces a coherent view of entrainment in both familiar and novel domains. Our results add to the body of knowledge of entrainment in human-human conversations and propose new directions for making use of that knowledge to enhance human-computer interactions.
author Levitan, Rivka
author_facet Levitan, Rivka
author_sort Levitan, Rivka
title Acoustic-Prosodic Entrainment in Human-Human and Human-Computer Dialogue
title_short Acoustic-Prosodic Entrainment in Human-Human and Human-Computer Dialogue
title_full Acoustic-Prosodic Entrainment in Human-Human and Human-Computer Dialogue
title_fullStr Acoustic-Prosodic Entrainment in Human-Human and Human-Computer Dialogue
title_full_unstemmed Acoustic-Prosodic Entrainment in Human-Human and Human-Computer Dialogue
title_sort acoustic-prosodic entrainment in human-human and human-computer dialogue
publishDate 2014
url https://doi.org/10.7916/D8GT5KCH
work_keys_str_mv AT levitanrivka acousticprosodicentrainmentinhumanhumanandhumancomputerdialogue
_version_ 1719045925437964288