Acoustic classification of focus: On the web and in the lab

We present a new methodological approach which combines both naturally-occurring speech harvested on the web and speech data elicited in the laboratory. This proof-of-concept study examines the phenomenon of focus sensitivity in English, in which the interpretation of particular grammatical construc...

Full description

Bibliographic Details
Main Authors: Jonathan Howell, Mats Rooth, Michael Wagner
Format: Article
Language:English
Published: Open Library of Humanities 2017-07-01
Series:Laboratory Phonology
Subjects:
Online Access:https://www.journal-labphon.org/articles/8
id doaj-ea1287d4a8e84935998a2b668ca2bf96
record_format Article
spelling doaj-ea1287d4a8e84935998a2b668ca2bf962021-10-02T07:53:33ZengOpen Library of HumanitiesLaboratory Phonology1868-63541868-63542017-07-018110.5334/labphon.838Acoustic classification of focus: On the web and in the labJonathan Howell0Mats Rooth1Michael Wagner2Montclair State UniversityCornell UniversityMcGill UniversityWe present a new methodological approach which combines both naturally-occurring speech harvested on the web and speech data elicited in the laboratory. This proof-of-concept study examines the phenomenon of focus sensitivity in English, in which the interpretation of particular grammatical constructions (e.g., the comparative) is sensitive to the location of prosodic prominence. Machine learning algorithms (support vector machines and linear discriminant analysis) and human perception experiments are used to cross-validate the web-harvested and lab-elicited speech. Results confirm the theoretical predictions for location of prominence in comparative clauses and the advantages using both web-harvested and lab-elicited speech. The most robust acoustic classifiers include paradigmatic (i.e., un-normalized), non-intonational acoustic measures (duration and relative formant frequencies from single segments). These acoustic cues are also significant predictors of human listeners’ classification, offering new evidence in the debate whether prominence is mainly encoded by pitch or by other cues, and the role that utterance-normalization plays when looking at non-pitch cues such as duration.https://www.journal-labphon.org/articles/8prosodymachine learningweb as corpusfocusprominenceacoustic classification
collection DOAJ
language English
format Article
sources DOAJ
author Jonathan Howell
Mats Rooth
Michael Wagner
spellingShingle Jonathan Howell
Mats Rooth
Michael Wagner
Acoustic classification of focus: On the web and in the lab
Laboratory Phonology
prosody
machine learning
web as corpus
focus
prominence
acoustic classification
author_facet Jonathan Howell
Mats Rooth
Michael Wagner
author_sort Jonathan Howell
title Acoustic classification of focus: On the web and in the lab
title_short Acoustic classification of focus: On the web and in the lab
title_full Acoustic classification of focus: On the web and in the lab
title_fullStr Acoustic classification of focus: On the web and in the lab
title_full_unstemmed Acoustic classification of focus: On the web and in the lab
title_sort acoustic classification of focus: on the web and in the lab
publisher Open Library of Humanities
series Laboratory Phonology
issn 1868-6354
1868-6354
publishDate 2017-07-01
description We present a new methodological approach which combines both naturally-occurring speech harvested on the web and speech data elicited in the laboratory. This proof-of-concept study examines the phenomenon of focus sensitivity in English, in which the interpretation of particular grammatical constructions (e.g., the comparative) is sensitive to the location of prosodic prominence. Machine learning algorithms (support vector machines and linear discriminant analysis) and human perception experiments are used to cross-validate the web-harvested and lab-elicited speech. Results confirm the theoretical predictions for location of prominence in comparative clauses and the advantages using both web-harvested and lab-elicited speech. The most robust acoustic classifiers include paradigmatic (i.e., un-normalized), non-intonational acoustic measures (duration and relative formant frequencies from single segments). These acoustic cues are also significant predictors of human listeners’ classification, offering new evidence in the debate whether prominence is mainly encoded by pitch or by other cues, and the role that utterance-normalization plays when looking at non-pitch cues such as duration.
topic prosody
machine learning
web as corpus
focus
prominence
acoustic classification
url https://www.journal-labphon.org/articles/8
work_keys_str_mv AT jonathanhowell acousticclassificationoffocusonthewebandinthelab
AT matsrooth acousticclassificationoffocusonthewebandinthelab
AT michaelwagner acousticclassificationoffocusonthewebandinthelab
_version_ 1716857332574453760