Temporal variability of fundamental frequency contours
Intonation is one of the means of performing a speech style. Thus, observing pitch variation in an utterance may be a clue to identifying speech style. We design a cumulative slope (CS) index based upon the amount of pitch variation in a measured F0 contour and the duration of that contour. The more...
Main Authors: | , |
---|---|
Format: | Article |
Language: | ces |
Published: |
Karolinum Press
2017-09-01
|
Series: | Acta Universitatis Carolinae: Philologica |
Subjects: | |
Online Access: | http://www.karolinum.cz/doi/10.14712/24646830.2017.31 |
id |
doaj-4c71fb7487db428dbe1842e986eaba05 |
---|---|
record_format |
Article |
spelling |
doaj-4c71fb7487db428dbe1842e986eaba052020-11-25T03:15:41ZcesKarolinum PressActa Universitatis Carolinae: Philologica0567-82692464-68302017-09-0120173354410.14712/24646830.2017.315263Temporal variability of fundamental frequency contoursRobin HruškaTomáš BořilIntonation is one of the means of performing a speech style. Thus, observing pitch variation in an utterance may be a clue to identifying speech style. We design a cumulative slope (CS) index based upon the amount of pitch variation in a measured F0 contour and the duration of that contour. The more pitch changes there are and the greater their frequency range is, the greater the CS index is. This is confirmed by an experiment we conduct: the CS index of utterances with expressive intonation is higher than that of utterances with neutral intonation, and for utterances with neutral intonation the CS index is higher than for utterances with monotonous or flat intonation. However, as there is a great variability between speakers, the CS index as defined currently, cannot be used to universally differentiate between the styles. Results obtained using automatic voice activity detection (VAD) are close to those obtained with manual VAD and thus the extraction of CS index can be reliably automatized.http://www.karolinum.cz/doi/10.14712/24646830.2017.31fundamental frequencymelody of speechstylizationvariability of pitch contours |
collection |
DOAJ |
language |
ces |
format |
Article |
sources |
DOAJ |
author |
Robin Hruška Tomáš Bořil |
spellingShingle |
Robin Hruška Tomáš Bořil Temporal variability of fundamental frequency contours Acta Universitatis Carolinae: Philologica fundamental frequency melody of speech stylization variability of pitch contours |
author_facet |
Robin Hruška Tomáš Bořil |
author_sort |
Robin Hruška |
title |
Temporal variability of fundamental frequency contours |
title_short |
Temporal variability of fundamental frequency contours |
title_full |
Temporal variability of fundamental frequency contours |
title_fullStr |
Temporal variability of fundamental frequency contours |
title_full_unstemmed |
Temporal variability of fundamental frequency contours |
title_sort |
temporal variability of fundamental frequency contours |
publisher |
Karolinum Press |
series |
Acta Universitatis Carolinae: Philologica |
issn |
0567-8269 2464-6830 |
publishDate |
2017-09-01 |
description |
Intonation is one of the means of performing a speech style. Thus, observing pitch variation in an utterance may be a clue to identifying speech style. We design a cumulative slope (CS) index based upon the amount of pitch variation in a measured F0 contour and the duration of that contour. The more pitch changes there are and the greater their frequency range is, the greater the CS index is. This is confirmed by an experiment we conduct: the CS index of utterances with expressive intonation is higher than that of utterances with neutral intonation, and for utterances with neutral intonation the CS index is higher than for utterances with monotonous or flat intonation. However, as there is a great variability between speakers, the CS index as defined currently, cannot be used to universally differentiate between the styles. Results obtained using automatic voice activity detection (VAD) are close to those obtained with manual VAD and thus the extraction of CS index can be reliably automatized. |
topic |
fundamental frequency melody of speech stylization variability of pitch contours |
url |
http://www.karolinum.cz/doi/10.14712/24646830.2017.31 |
work_keys_str_mv |
AT robinhruska temporalvariabilityoffundamentalfrequencycontours AT tomasboril temporalvariabilityoffundamentalfrequencycontours |
_version_ |
1724638147676995584 |