Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS

博士 === 國立交通大學 === 電信工程研究所 === 105 === A structural maximum a posteriori (SMAP) speaker adaptation approach to adjusting the speaking rate (SR)-dependent hierarchical prosodic model (SR-HPM) of an existing SR-controlled Mandarin text-to-speech (SC-MTTS) system to a new speaker’s data for producing a...

Full description

Bibliographic Details
Main Authors:	Liao, I-Bin, 廖宜斌
Other Authors:	Chen, Sin-Horng
Format:	Others
Language:	en_US
Published:	2016
Online Access:	http://ndltd.ncl.edu.tw/handle/3pp89v

id	ndltd-TW-105NCTU5435032
record_format	oai_dc
spelling	ndltd-TW-105NCTU54350322019-05-15T23:09:04Z http://ndltd.ncl.edu.tw/handle/3pp89v Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS 語速相依韻律模型之語者調適技術與應用 Liao, I-Bin 廖宜斌博士國立交通大學電信工程研究所 105 A structural maximum a posteriori (SMAP) speaker adaptation approach to adjusting the speaking rate (SR)-dependent hierarchical prosodic model (SR-HPM) of an existing SR-controlled Mandarin text-to-speech (SC-MTTS) system to a new speaker’s data for producing a new voice is discussed. Two main issues are addressed. One is the small SR coverage of the adaptation data and is solved by using the existing SR-HPM which was trained from a speech corpus of wide SR coverage as an informative prior. Another is the data sparseness problem resulting from the large number of parameters of the SR-HPM to be adjusted. It is solved by hierarchically organizing the SR-HPM parameters into decision-trees so as to be efficiently adjusted by the SMAP method. The effectiveness of the proposed approach is evaluated on speech databases of five new speakers. Both objective and subjective evaluations show that the proposed method not only performs better than the maximum likelihood-based method in the observed SR range of the target speaker’s data, but also is much better in the unseen SR ranges. Chen, Sin-Horng 陳信宏 2016 學位論文 ; thesis 79 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	博士 === 國立交通大學 === 電信工程研究所 === 105 === A structural maximum a posteriori (SMAP) speaker adaptation approach to adjusting the speaking rate (SR)-dependent hierarchical prosodic model (SR-HPM) of an existing SR-controlled Mandarin text-to-speech (SC-MTTS) system to a new speaker’s data for producing a new voice is discussed. Two main issues are addressed. One is the small SR coverage of the adaptation data and is solved by using the existing SR-HPM which was trained from a speech corpus of wide SR coverage as an informative prior. Another is the data sparseness problem resulting from the large number of parameters of the SR-HPM to be adjusted. It is solved by hierarchically organizing the SR-HPM parameters into decision-trees so as to be efficiently adjusted by the SMAP method. The effectiveness of the proposed approach is evaluated on speech databases of five new speakers. Both objective and subjective evaluations show that the proposed method not only performs better than the maximum likelihood-based method in the observed SR range of the target speaker’s data, but also is much better in the unseen SR ranges.
author2	Chen, Sin-Horng
author_facet	Chen, Sin-Horng Liao, I-Bin 廖宜斌
author	Liao, I-Bin 廖宜斌
spellingShingle	Liao, I-Bin 廖宜斌 Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS
author_sort	Liao, I-Bin
title	Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS
title_short	Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS
title_full	Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS
title_fullStr	Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS
title_full_unstemmed	Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS
title_sort	speaker adaptation of sr-hpm for speaking rate-controlled mandarin tts
publishDate	2016
url	http://ndltd.ncl.edu.tw/handle/3pp89v
work_keys_str_mv	AT liaoibin speakeradaptationofsrhpmforspeakingratecontrolledmandarintts AT liàoyíbīn speakeradaptationofsrhpmforspeakingratecontrolledmandarintts AT liaoibin yǔsùxiāngyīyùnlǜmóxíngzhīyǔzhědiàoshìjìshùyǔyīngyòng AT liàoyíbīn yǔsùxiāngyīyùnlǜmóxíngzhīyǔzhědiàoshìjìshùyǔyīngyòng
_version_	1719141045709570048

Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS

Similar Items