Learning to identify emotional voices

Reduced abilities in talker identification are observed when listeners are presented with the voices of familiar talkers while in an unfamiliar emotional tone. Despite the acoustic variations caused by different emotional states, listeners tend to demonstrate extraordinary abilities in matching the...

Full description

Bibliographic Details
Main Author:	Shen, Lue
Other Authors:	Perrachione, Tyler
Language:	en_US
Published:	2021
Subjects:	Speech therapy
Online Access:	https://hdl.handle.net/2144/41924

id	ndltd-bu.edu-oai-open.bu.edu-2144-41924
record_format	oai_dc
spelling	ndltd-bu.edu-oai-open.bu.edu-2144-419242021-01-29T05:01:13Z Learning to identify emotional voices Shen, Lue Perrachione, Tyler Speech therapy Reduced abilities in talker identification are observed when listeners are presented with the voices of familiar talkers while in an unfamiliar emotional tone. Despite the acoustic variations caused by different emotional states, listeners tend to demonstrate extraordinary abilities in matching the voices to their corresponding talkers, which suggests their perceptual constancies of voice-identity correspondence. However, the distinctive acoustic-perceptual correlates contributing to the formation of perceptual constancy have been rarely studied from the aspect of within-talker variability (i.e., how do listeners know they are hearing the same talkers when many of the key acoustic features of their voices are inconsistent across different contexts?). This study investigated the influence of variation in the emotional tone of voice on listeners’ abilities for talker identification. We explicitly trained our participants with five voices in an emotional state and tested their generalization abilities in talker identification by presenting them with the same talkers’ voices in the trained emotional state and the other untrained emotional state. Our results showed that listeners were more accurate when they were presented with trained emotional states than untrained emotional states. The improved accuracy supported that listeners benefitted from their early experience with the voices across multiple contexts to form their constant perceptual representations of voice identities. We also observed a significant correlation between the difference of mean fundamental frequency (f0) and accuracy, suggesting the differences of mean f0 tended to be a distinctive parameter to quantify the invariant features in the vocal signals. 2021-01-27T18:14:17Z 2021-01-27T18:14:17Z 2021 2021-01-22T23:05:06Z Thesis/Dissertation https://hdl.handle.net/2144/41924 0000-0001-8485-3278 en_US
collection	NDLTD
language	en_US
sources	NDLTD
topic	Speech therapy
spellingShingle	Speech therapy Shen, Lue Learning to identify emotional voices
description	Reduced abilities in talker identification are observed when listeners are presented with the voices of familiar talkers while in an unfamiliar emotional tone. Despite the acoustic variations caused by different emotional states, listeners tend to demonstrate extraordinary abilities in matching the voices to their corresponding talkers, which suggests their perceptual constancies of voice-identity correspondence. However, the distinctive acoustic-perceptual correlates contributing to the formation of perceptual constancy have been rarely studied from the aspect of within-talker variability (i.e., how do listeners know they are hearing the same talkers when many of the key acoustic features of their voices are inconsistent across different contexts?). This study investigated the influence of variation in the emotional tone of voice on listeners’ abilities for talker identification. We explicitly trained our participants with five voices in an emotional state and tested their generalization abilities in talker identification by presenting them with the same talkers’ voices in the trained emotional state and the other untrained emotional state. Our results showed that listeners were more accurate when they were presented with trained emotional states than untrained emotional states. The improved accuracy supported that listeners benefitted from their early experience with the voices across multiple contexts to form their constant perceptual representations of voice identities. We also observed a significant correlation between the difference of mean fundamental frequency (f0) and accuracy, suggesting the differences of mean f0 tended to be a distinctive parameter to quantify the invariant features in the vocal signals.
author2	Perrachione, Tyler
author_facet	Perrachione, Tyler Shen, Lue
author	Shen, Lue
author_sort	Shen, Lue
title	Learning to identify emotional voices
title_short	Learning to identify emotional voices
title_full	Learning to identify emotional voices
title_fullStr	Learning to identify emotional voices
title_full_unstemmed	Learning to identify emotional voices
title_sort	learning to identify emotional voices
publishDate	2021
url	https://hdl.handle.net/2144/41924
work_keys_str_mv	AT shenlue learningtoidentifyemotionalvoices
_version_	1719374769865883648

Learning to identify emotional voices

Similar Items