Perception of incongruent audiovisual English consonants.

Causal inference-the process of deciding whether two incoming signals come from the same source-is an important step in audiovisual (AV) speech perception. This research explored causal inference and perception of incongruent AV English consonants. Nine adults were presented auditory, visual, congru...

Full description

Bibliographic Details
Main Authors:	Kaylah Lalonde, Lynne A Werner
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2019-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0213588

id	doaj-5fc453245ea444938be4225824b5b070
record_format	Article
spelling	doaj-5fc453245ea444938be4225824b5b0702021-03-03T21:40:46ZengPublic Library of Science (PLoS)PLoS ONE1932-62032019-01-01143e021358810.1371/journal.pone.0213588Perception of incongruent audiovisual English consonants.Kaylah LalondeLynne A WernerCausal inference-the process of deciding whether two incoming signals come from the same source-is an important step in audiovisual (AV) speech perception. This research explored causal inference and perception of incongruent AV English consonants. Nine adults were presented auditory, visual, congruent AV, and incongruent AV consonant-vowel syllables. Incongruent AV stimuli included auditory and visual syllables with matched vowels, but mismatched consonants. Open-set responses were collected. For most incongruent syllables, participants were aware of the mismatch between auditory and visual signals (59.04%) or reported the auditory syllable (33.73%). Otherwise, participants reported the visual syllable (1.13%) or some other syllable (6.11%). Statistical analyses were used to assess whether visual distinctiveness and place, voice, and manner features predicted responses. Mismatch responses occurred more when the auditory and visual consonants were visually distinct, when place and manner differed across auditory and visual consonants, and for consonants with high visual accuracy. Auditory responses occurred more when the auditory and visual consonants were visually similar, when place and manner were the same across auditory and visual stimuli, and with consonants produced further back in the mouth. Visual responses occurred more when voicing and manner were the same across auditory and visual stimuli, and for front and middle consonants. Other responses were variable, but typically matched the visual place, auditory voice, and auditory manner of the input. Overall, results indicate that causal inference and incongruent AV consonant perception depend on salience and reliability of auditory and visual inputs and degree of redundancy between auditory and visual inputs. A parameter-free computational model of incongruent AV speech perception based on unimodal confusions, with a causal inference rule, was applied. Data from the current study present an opportunity to test and improve the generalizability of current AV speech integration models.https://doi.org/10.1371/journal.pone.0213588
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Kaylah Lalonde Lynne A Werner
spellingShingle	Kaylah Lalonde Lynne A Werner Perception of incongruent audiovisual English consonants. PLoS ONE
author_facet	Kaylah Lalonde Lynne A Werner
author_sort	Kaylah Lalonde
title	Perception of incongruent audiovisual English consonants.
title_short	Perception of incongruent audiovisual English consonants.
title_full	Perception of incongruent audiovisual English consonants.
title_fullStr	Perception of incongruent audiovisual English consonants.
title_full_unstemmed	Perception of incongruent audiovisual English consonants.
title_sort	perception of incongruent audiovisual english consonants.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2019-01-01
description	Causal inference-the process of deciding whether two incoming signals come from the same source-is an important step in audiovisual (AV) speech perception. This research explored causal inference and perception of incongruent AV English consonants. Nine adults were presented auditory, visual, congruent AV, and incongruent AV consonant-vowel syllables. Incongruent AV stimuli included auditory and visual syllables with matched vowels, but mismatched consonants. Open-set responses were collected. For most incongruent syllables, participants were aware of the mismatch between auditory and visual signals (59.04%) or reported the auditory syllable (33.73%). Otherwise, participants reported the visual syllable (1.13%) or some other syllable (6.11%). Statistical analyses were used to assess whether visual distinctiveness and place, voice, and manner features predicted responses. Mismatch responses occurred more when the auditory and visual consonants were visually distinct, when place and manner differed across auditory and visual consonants, and for consonants with high visual accuracy. Auditory responses occurred more when the auditory and visual consonants were visually similar, when place and manner were the same across auditory and visual stimuli, and with consonants produced further back in the mouth. Visual responses occurred more when voicing and manner were the same across auditory and visual stimuli, and for front and middle consonants. Other responses were variable, but typically matched the visual place, auditory voice, and auditory manner of the input. Overall, results indicate that causal inference and incongruent AV consonant perception depend on salience and reliability of auditory and visual inputs and degree of redundancy between auditory and visual inputs. A parameter-free computational model of incongruent AV speech perception based on unimodal confusions, with a causal inference rule, was applied. Data from the current study present an opportunity to test and improve the generalizability of current AV speech integration models.
url	https://doi.org/10.1371/journal.pone.0213588
work_keys_str_mv	AT kaylahlalonde perceptionofincongruentaudiovisualenglishconsonants AT lynneawerner perceptionofincongruentaudiovisualenglishconsonants
_version_	1714815635133825024

Perception of incongruent audiovisual English consonants.

Similar Items