Acoustic-Emergent Phonology in the Amplitude Envelope of Child-Directed Speech.

When acquiring language, young children may use acoustic spectro-temporal patterns in speech to derive phonological units in spoken language (e.g., prosodic stress patterns, syllables, phonemes). Children appear to learn acoustic-phonological mappings rapidly, without direct instruction, yet the und...

Full description

Bibliographic Details
Main Authors: Victoria Leong, Usha Goswami
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4671555?pdf=render
id doaj-a58c4f9c9a914151a0f54dc7553b27c8
record_format Article
spelling doaj-a58c4f9c9a914151a0f54dc7553b27c82020-11-25T02:47:44ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-011012e014441110.1371/journal.pone.0144411Acoustic-Emergent Phonology in the Amplitude Envelope of Child-Directed Speech.Victoria LeongUsha GoswamiWhen acquiring language, young children may use acoustic spectro-temporal patterns in speech to derive phonological units in spoken language (e.g., prosodic stress patterns, syllables, phonemes). Children appear to learn acoustic-phonological mappings rapidly, without direct instruction, yet the underlying developmental mechanisms remain unclear. Across different languages, a relationship between amplitude envelope sensitivity and phonological development has been found, suggesting that children may make use of amplitude modulation (AM) patterns within the envelope to develop a phonological system. Here we present the Spectral Amplitude Modulation Phase Hierarchy (S-AMPH) model, a set of algorithms for deriving the dominant AM patterns in child-directed speech (CDS). Using Principal Components Analysis, we show that rhythmic CDS contains an AM hierarchy comprising 3 core modulation timescales. These timescales correspond to key phonological units: prosodic stress (Stress AM, ~2 Hz), syllables (Syllable AM, ~5 Hz) and onset-rime units (Phoneme AM, ~20 Hz). We argue that these AM patterns could in principle be used by naïve listeners to compute acoustic-phonological mappings without lexical knowledge. We then demonstrate that the modulation statistics within this AM hierarchy indeed parse the speech signal into a primitive hierarchically-organised phonological system comprising stress feet (proto-words), syllables and onset-rime units. We apply the S-AMPH model to two other CDS corpora, one spontaneous and one deliberately-timed. The model accurately identified 72-82% (freely-read CDS) and 90-98% (rhythmically-regular CDS) stress patterns, syllables and onset-rime units. This in-principle demonstration that primitive phonology can be extracted from speech AMs is termed Acoustic-Emergent Phonology (AEP) theory. AEP theory provides a set of methods for examining how early phonological development is shaped by the temporal modulation structure of speech across languages. The S-AMPH model reveals a crucial developmental role for stress feet (AMs ~2 Hz). Stress feet underpin different linguistic rhythm typologies, and speech rhythm underpins language acquisition by infants in all languages.http://europepmc.org/articles/PMC4671555?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Victoria Leong
Usha Goswami
spellingShingle Victoria Leong
Usha Goswami
Acoustic-Emergent Phonology in the Amplitude Envelope of Child-Directed Speech.
PLoS ONE
author_facet Victoria Leong
Usha Goswami
author_sort Victoria Leong
title Acoustic-Emergent Phonology in the Amplitude Envelope of Child-Directed Speech.
title_short Acoustic-Emergent Phonology in the Amplitude Envelope of Child-Directed Speech.
title_full Acoustic-Emergent Phonology in the Amplitude Envelope of Child-Directed Speech.
title_fullStr Acoustic-Emergent Phonology in the Amplitude Envelope of Child-Directed Speech.
title_full_unstemmed Acoustic-Emergent Phonology in the Amplitude Envelope of Child-Directed Speech.
title_sort acoustic-emergent phonology in the amplitude envelope of child-directed speech.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2015-01-01
description When acquiring language, young children may use acoustic spectro-temporal patterns in speech to derive phonological units in spoken language (e.g., prosodic stress patterns, syllables, phonemes). Children appear to learn acoustic-phonological mappings rapidly, without direct instruction, yet the underlying developmental mechanisms remain unclear. Across different languages, a relationship between amplitude envelope sensitivity and phonological development has been found, suggesting that children may make use of amplitude modulation (AM) patterns within the envelope to develop a phonological system. Here we present the Spectral Amplitude Modulation Phase Hierarchy (S-AMPH) model, a set of algorithms for deriving the dominant AM patterns in child-directed speech (CDS). Using Principal Components Analysis, we show that rhythmic CDS contains an AM hierarchy comprising 3 core modulation timescales. These timescales correspond to key phonological units: prosodic stress (Stress AM, ~2 Hz), syllables (Syllable AM, ~5 Hz) and onset-rime units (Phoneme AM, ~20 Hz). We argue that these AM patterns could in principle be used by naïve listeners to compute acoustic-phonological mappings without lexical knowledge. We then demonstrate that the modulation statistics within this AM hierarchy indeed parse the speech signal into a primitive hierarchically-organised phonological system comprising stress feet (proto-words), syllables and onset-rime units. We apply the S-AMPH model to two other CDS corpora, one spontaneous and one deliberately-timed. The model accurately identified 72-82% (freely-read CDS) and 90-98% (rhythmically-regular CDS) stress patterns, syllables and onset-rime units. This in-principle demonstration that primitive phonology can be extracted from speech AMs is termed Acoustic-Emergent Phonology (AEP) theory. AEP theory provides a set of methods for examining how early phonological development is shaped by the temporal modulation structure of speech across languages. The S-AMPH model reveals a crucial developmental role for stress feet (AMs ~2 Hz). Stress feet underpin different linguistic rhythm typologies, and speech rhythm underpins language acquisition by infants in all languages.
url http://europepmc.org/articles/PMC4671555?pdf=render
work_keys_str_mv AT victorialeong acousticemergentphonologyintheamplitudeenvelopeofchilddirectedspeech
AT ushagoswami acousticemergentphonologyintheamplitudeenvelopeofchilddirectedspeech
_version_ 1724751744111476736