Tone realisation for speech synthesis of Yorùbá / Daniel Rudolph van Niekerk

Speech technologies such as text-to-speech synthesis (TTS) and automatic speech recognition (ASR) have recently generated much interest in the developed world as a user-interface medium to smartphones [1, 2]. However, it is also recognised that these technologies may potentially have a positive impa...

Full description

Bibliographic Details
Main Author:	Van Niekerk, Daniel Rudolph
Language:	en
Published:	North West University 2015
Subjects:	Speech synthesis Text-to-speech Intonation model Target approximation Tone language Yorùbá Under-resourced languages
Online Access:	http://hdl.handle.net/10394/13054

id	ndltd-netd.ac.za-oai-union.ndltd.org-nwu-oai-dspace.nwu.ac.za-10394-13054
record_format	oai_dc
spelling	ndltd-netd.ac.za-oai-union.ndltd.org-nwu-oai-dspace.nwu.ac.za-10394-130542016-03-16T03:59:07ZTone realisation for speech synthesis of Yorùbá / Daniel Rudolph van NiekerkVan Niekerk, Daniel RudolphSpeech synthesisText-to-speechIntonation modelTarget approximationTone languageYorùbáUnder-resourced languagesSpeech technologies such as text-to-speech synthesis (TTS) and automatic speech recognition (ASR) have recently generated much interest in the developed world as a user-interface medium to smartphones [1, 2]. However, it is also recognised that these technologies may potentially have a positive impact on the lives of those in the developing world, especially in Africa, by presenting an important medium for access to information where illiteracy and a lack of infrastructure play a limiting role [3, 4, 5, 6]. While these technologies continually experience important advances that keep extending their applicability to new and under-resourced languages, one particular area in need of further development is speech synthesis of African tone languages [7, 8]. The main objective of this work is acoustic modelling and synthesis of tone for an African tone,language: Yorùbá. We present an empirical investigation to establish the acoustic properties of tone in Yorùbá, and to evaluate resulting models integrated into a Hidden Markov model-based (HMMbased) TTS system. We show that in Yorùbá, which is considered a register tone language, the realisation of tone is not solely determined by pitch levels, but also inter-syllable and intra-syllable pitch dynamics. Furthermore, our experimental results indicate that utterance-wide pitch patterns are not only a result of cumulative local pitch changes (terracing), but do contain a significant gradual declination component. Lastly, models based on inter- and intra-syllable pitch dynamics using underlying linear pitch targets are shown to be relatively efficient and perceptually preferable to the current standard approach in statistical parametric speech synthesis employing HMM pitch models based on context-dependent phones. These findings support the applicability of the proposed models in under-resourced conditions.PhD (Information Technology), North-West University, Vaal Triangle Campus, 2014North West University2015-01-28T08:01:27Z2015-01-28T08:01:27Z2014Thesishttp://hdl.handle.net/10394/13054en
collection	NDLTD
language	en
sources	NDLTD
topic	Speech synthesis Text-to-speech Intonation model Target approximation Tone language Yorùbá Under-resourced languages
spellingShingle	Speech synthesis Text-to-speech Intonation model Target approximation Tone language Yorùbá Under-resourced languages Van Niekerk, Daniel Rudolph Tone realisation for speech synthesis of Yorùbá / Daniel Rudolph van Niekerk
description	Speech technologies such as text-to-speech synthesis (TTS) and automatic speech recognition (ASR) have recently generated much interest in the developed world as a user-interface medium to smartphones [1, 2]. However, it is also recognised that these technologies may potentially have a positive impact on the lives of those in the developing world, especially in Africa, by presenting an important medium for access to information where illiteracy and a lack of infrastructure play a limiting role [3, 4, 5, 6]. While these technologies continually experience important advances that keep extending their applicability to new and under-resourced languages, one particular area in need of further development is speech synthesis of African tone languages [7, 8]. The main objective of this work is acoustic modelling and synthesis of tone for an African tone,language: Yorùbá. We present an empirical investigation to establish the acoustic properties of tone in Yorùbá, and to evaluate resulting models integrated into a Hidden Markov model-based (HMMbased) TTS system. We show that in Yorùbá, which is considered a register tone language, the realisation of tone is not solely determined by pitch levels, but also inter-syllable and intra-syllable pitch dynamics. Furthermore, our experimental results indicate that utterance-wide pitch patterns are not only a result of cumulative local pitch changes (terracing), but do contain a significant gradual declination component. Lastly, models based on inter- and intra-syllable pitch dynamics using underlying linear pitch targets are shown to be relatively efficient and perceptually preferable to the current standard approach in statistical parametric speech synthesis employing HMM pitch models based on context-dependent phones. These findings support the applicability of the proposed models in under-resourced conditions. === PhD (Information Technology), North-West University, Vaal Triangle Campus, 2014
author	Van Niekerk, Daniel Rudolph
author_facet	Van Niekerk, Daniel Rudolph
author_sort	Van Niekerk, Daniel Rudolph
title	Tone realisation for speech synthesis of Yorùbá / Daniel Rudolph van Niekerk
title_short	Tone realisation for speech synthesis of Yorùbá / Daniel Rudolph van Niekerk
title_full	Tone realisation for speech synthesis of Yorùbá / Daniel Rudolph van Niekerk
title_fullStr	Tone realisation for speech synthesis of Yorùbá / Daniel Rudolph van Niekerk
title_full_unstemmed	Tone realisation for speech synthesis of Yorùbá / Daniel Rudolph van Niekerk
title_sort	tone realisation for speech synthesis of yorùbá / daniel rudolph van niekerk
publisher	North West University
publishDate	2015
url	http://hdl.handle.net/10394/13054
work_keys_str_mv	AT vanniekerkdanielrudolph tonerealisationforspeechsynthesisofyorubadanielrudolphvanniekerk
_version_	1718204849395335168

Tone realisation for speech synthesis of Yorùbá / Daniel Rudolph van Niekerk

Similar Items