Prosody analysis and modeling for Cantonese text-to-speech.

Li Yu Jia. === Thesis (M.Phil.)--Chinese University of Hong Kong, 2003. === Includes bibliographical references. === Abstracts in English and Chinese. === Chapter Chapter 1 --- Introduction --- p.1 === Chapter 1.1. --- TTS Technology --- p.1 === Chapter 1.2. --- Prosody --- p.2 === Chapter 1.2.1....

Full description

Bibliographic Details
Other Authors: Li, Yu Jia.
Format: Others
Language:English
Chinese
Published: 2003
Subjects:
Online Access:http://library.cuhk.edu.hk/record=b5891678
http://repository.lib.cuhk.edu.hk/en/item/cuhk-324229
id ndltd-cuhk.edu.hk-oai-cuhk-dr-cuhk_324229
record_format oai_dc
collection NDLTD
language English
Chinese
format Others
sources NDLTD
topic Speech processing systems
Prosodic analysis (Linguistics)
Speech synthesis
Cantonese dialects--Data processing
spellingShingle Speech processing systems
Prosodic analysis (Linguistics)
Speech synthesis
Cantonese dialects--Data processing
Prosody analysis and modeling for Cantonese text-to-speech.
description Li Yu Jia. === Thesis (M.Phil.)--Chinese University of Hong Kong, 2003. === Includes bibliographical references. === Abstracts in English and Chinese. === Chapter Chapter 1 --- Introduction --- p.1 === Chapter 1.1. --- TTS Technology --- p.1 === Chapter 1.2. --- Prosody --- p.2 === Chapter 1.2.1. --- What is Prosody --- p.2 === Chapter 1.2.2. --- Prosody from Different Perspectives --- p.3 === Chapter 1.2.3. --- Acoustical Parameters of Prosody --- p.3 === Chapter 1.2.4. --- Prosody in TTS --- p.5 === Chapter 1.2.4.1 --- Analysis --- p.5 === Chapter 1.2.4.2 --- Modeling --- p.6 === Chapter 1.2.4.3 --- Evaluation --- p.6 === Chapter 1.3. --- Thesis Objectives --- p.7 === Chapter 1.4. --- Thesis Outline --- p.7 === Reference --- p.8 === Chapter Chapter 2 --- Cantonese --- p.9 === Chapter 2.1. --- The Cantonese Dialect --- p.9 === Chapter 2.1.1. --- Phonology --- p.10 === Chapter 2.1.1.1 --- Initial --- p.11 === Chapter 2.1.1.2 --- Final --- p.12 === Chapter 2.1.1.3 --- Tone --- p.13 === Chapter 2.1.2. --- Phonological Constraints --- p.14 === Chapter 2.2. --- Tones in Cantonese --- p.15 === Chapter 2.2.1. --- Tone System --- p.15 === Chapter 2.2.2. --- Linguistic Significance --- p.18 === Chapter 2.2.3. --- Acoustical Realization --- p.18 === Chapter 2.3. --- Prosodic Variation in Continuous Cantonese Speech --- p.20 === Chapter 2.4. --- Cantonese Speech Corpus - CUProsody --- p.21 === Reference --- p.23 === Chapter Chapter 3 --- F0 Normalization --- p.25 === Chapter 3.1. --- F0 in Speech Production --- p.25 === Chapter 3.2. --- F0 Extraction --- p.27 === Chapter 3.3. --- Duration-normalized Tone Contour --- p.29 === Chapter 3.4. --- F0 Normalization --- p.30 === Chapter 3.4.1. --- Necessity and Motivation --- p.30 === Chapter 3.4.2. --- F0 Normalization --- p.33 === Chapter 3.4.2.1 --- Methodology --- p.33 === Chapter 3.4.2.2 --- Assumptions --- p.34 === Chapter 3.4.2.3 --- Estimation of Relative Tone Ratios --- p.35 === Chapter 3.4.2.4 --- Derivation of Phrase Curve --- p.37 === Chapter 3.4.2.5 --- Normalization of Absolute FO Values --- p.39 === Chapter 3.4.3. --- Experiments and Discussion --- p.39 === Chapter 3.5. --- Conclusions --- p.44 === Reference --- p.45 === Chapter Chapter 4 --- Acoustical FO Analysis --- p.48 === Chapter 4.1. --- Methodology of FO Analysis --- p.48 === Chapter 4.1.1. --- Analysis-by-Synthesis --- p.48 === Chapter 4.1.2. --- Acoustical Analysis --- p.51 === Chapter 4.2. --- Acoustical FO Analysis for Cantonese --- p.52 === Chapter 4.2.1. --- Analysis of Phrase Curves --- p.52 === Chapter 4.2.2. --- Analysis of Tone Contours --- p.55 === Chapter 4.2.2.1 --- Context-independent Single-tone Contours --- p.56 === Chapter 4.2.2.2 --- Contextual Variation --- p.58 === Chapter 4.2.2.3 --- Co-articulated Tone Contours of Disyllabic Word --- p.59 === Chapter 4.2.2.4 --- Cross-word Contours --- p.62 === Chapter 4.2.2.5 --- Phrase-initial Tone Contours --- p.65 === Chapter 4.3. --- Summary --- p.66 === Reference --- p.67 === Chapter Chapter5 --- Prosody Modeling for Cantonese Text-to-Speech --- p.70 === Chapter 5.1. --- Parametric Model and Non-parametric Model --- p.70 === Chapter 5.2. --- Cantonese Text-to-Speech: Baseline System --- p.72 === Chapter 5.2.1. --- Sub-syllable Unit --- p.72 === Chapter 5.2.2. --- Text Analysis Module --- p.73 === Chapter 5.2.3. --- Acoustical Synthesis --- p.74 === Chapter 5.2.4. --- Prosody Module --- p.74 === Chapter 5.3. --- Enhanced Prosody Model --- p.74 === Chapter 5.3.1. --- Modeling Tone Contours --- p.75 === Chapter 5.3.1.1 --- Word-level FO Contours --- p.76 === Chapter 5.3.1.2 --- Phrase-initial Tone Contours --- p.77 === Chapter 5.3.1.3 --- Tone Contours at Word Boundary --- p.78 === Chapter 5.3.2. --- Modeling Phrase Curves --- p.79 === Chapter 5.3.3. --- Generation of Continuous FO Contours --- p.81 === Chapter 5.4. --- Summary --- p.81 === Reference --- p.82 === Chapter Chapter 6 --- Performance Evaluation --- p.83 === Chapter 6.1. --- Introduction to Perceptual Test --- p.83 === Chapter 6.1.1. --- Aspects of Evaluation --- p.84 === Chapter 6.1.2. --- Methods of Judgment Test --- p.84 === Chapter 6.1.3. --- Problems in Perceptual Test --- p.85 === Chapter 6.2. --- Perceptual Tests for Cantonese TTS --- p.86 === Chapter 6.2.1. --- Intelligibility Tests --- p.86 === Chapter 6.2.1.1 --- Method --- p.86 === Chapter 6.2.1.2 --- Results --- p.88 === Chapter 6.2.1.3 --- Analysis --- p.89 === Chapter 6.2.2. --- Naturalness Tests --- p.90 === Chapter 6.2.2.1 --- Word-level --- p.90 === Chapter 6.2.2.1.1 --- Method --- p.90 === Chapter 6.2.2.1.2 --- Results --- p.91 === Chapter 6.2.3.1.3 --- Analysis --- p.91 === Chapter 6.2.2.2 --- Sentence-level --- p.92 === Chapter 6.2.2.2.1 --- Method --- p.92 === Chapter 6.2.2.2.2 --- Results --- p.93 === Chapter 6.2.2.2.3 --- Analysis --- p.94 === Chapter 6.3. --- Conclusions --- p.95 === Chapter 6.4. --- Summary --- p.95 === Reference --- p.96 === Chapter Chapter 7 --- Conclusions and Future Work --- p.97 === Chapter 7.1. --- Conclusions --- p.97 === Chapter 7.2. --- Suggested Future Work --- p.99 === Appendix --- p.100 === Appendix 1 Linear Regression --- p.100 === Appendix 2 36 Templates of Cross-word Contours --- p.101 === Appendix 3 Word List for Word-level Tests --- p.102 === Appendix 4 Syllable Occurrence in Word List of Intelligibility Test --- p.108 === Appendix 5 Wrongly Identified Word List --- p.112 === Appendix 6 Confusion Matrix --- p.115 === Appendix 7 Unintelligible Word List --- p.117 === Appendix 8 Noisy Word List --- p.119 === Appendix 9 Sentence List for Naturalness Test --- p.120
author2 Li, Yu Jia.
author_facet Li, Yu Jia.
title Prosody analysis and modeling for Cantonese text-to-speech.
title_short Prosody analysis and modeling for Cantonese text-to-speech.
title_full Prosody analysis and modeling for Cantonese text-to-speech.
title_fullStr Prosody analysis and modeling for Cantonese text-to-speech.
title_full_unstemmed Prosody analysis and modeling for Cantonese text-to-speech.
title_sort prosody analysis and modeling for cantonese text-to-speech.
publishDate 2003
url http://library.cuhk.edu.hk/record=b5891678
http://repository.lib.cuhk.edu.hk/en/item/cuhk-324229
_version_ 1718983106898165760
spelling ndltd-cuhk.edu.hk-oai-cuhk-dr-cuhk_3242292019-02-26T03:36:16Z Prosody analysis and modeling for Cantonese text-to-speech. Speech processing systems Prosodic analysis (Linguistics) Speech synthesis Cantonese dialects--Data processing Li Yu Jia. Thesis (M.Phil.)--Chinese University of Hong Kong, 2003. Includes bibliographical references. Abstracts in English and Chinese. Chapter Chapter 1 --- Introduction --- p.1 Chapter 1.1. --- TTS Technology --- p.1 Chapter 1.2. --- Prosody --- p.2 Chapter 1.2.1. --- What is Prosody --- p.2 Chapter 1.2.2. --- Prosody from Different Perspectives --- p.3 Chapter 1.2.3. --- Acoustical Parameters of Prosody --- p.3 Chapter 1.2.4. --- Prosody in TTS --- p.5 Chapter 1.2.4.1 --- Analysis --- p.5 Chapter 1.2.4.2 --- Modeling --- p.6 Chapter 1.2.4.3 --- Evaluation --- p.6 Chapter 1.3. --- Thesis Objectives --- p.7 Chapter 1.4. --- Thesis Outline --- p.7 Reference --- p.8 Chapter Chapter 2 --- Cantonese --- p.9 Chapter 2.1. --- The Cantonese Dialect --- p.9 Chapter 2.1.1. --- Phonology --- p.10 Chapter 2.1.1.1 --- Initial --- p.11 Chapter 2.1.1.2 --- Final --- p.12 Chapter 2.1.1.3 --- Tone --- p.13 Chapter 2.1.2. --- Phonological Constraints --- p.14 Chapter 2.2. --- Tones in Cantonese --- p.15 Chapter 2.2.1. --- Tone System --- p.15 Chapter 2.2.2. --- Linguistic Significance --- p.18 Chapter 2.2.3. --- Acoustical Realization --- p.18 Chapter 2.3. --- Prosodic Variation in Continuous Cantonese Speech --- p.20 Chapter 2.4. --- Cantonese Speech Corpus - CUProsody --- p.21 Reference --- p.23 Chapter Chapter 3 --- F0 Normalization --- p.25 Chapter 3.1. --- F0 in Speech Production --- p.25 Chapter 3.2. --- F0 Extraction --- p.27 Chapter 3.3. --- Duration-normalized Tone Contour --- p.29 Chapter 3.4. --- F0 Normalization --- p.30 Chapter 3.4.1. --- Necessity and Motivation --- p.30 Chapter 3.4.2. --- F0 Normalization --- p.33 Chapter 3.4.2.1 --- Methodology --- p.33 Chapter 3.4.2.2 --- Assumptions --- p.34 Chapter 3.4.2.3 --- Estimation of Relative Tone Ratios --- p.35 Chapter 3.4.2.4 --- Derivation of Phrase Curve --- p.37 Chapter 3.4.2.5 --- Normalization of Absolute FO Values --- p.39 Chapter 3.4.3. --- Experiments and Discussion --- p.39 Chapter 3.5. --- Conclusions --- p.44 Reference --- p.45 Chapter Chapter 4 --- Acoustical FO Analysis --- p.48 Chapter 4.1. --- Methodology of FO Analysis --- p.48 Chapter 4.1.1. --- Analysis-by-Synthesis --- p.48 Chapter 4.1.2. --- Acoustical Analysis --- p.51 Chapter 4.2. --- Acoustical FO Analysis for Cantonese --- p.52 Chapter 4.2.1. --- Analysis of Phrase Curves --- p.52 Chapter 4.2.2. --- Analysis of Tone Contours --- p.55 Chapter 4.2.2.1 --- Context-independent Single-tone Contours --- p.56 Chapter 4.2.2.2 --- Contextual Variation --- p.58 Chapter 4.2.2.3 --- Co-articulated Tone Contours of Disyllabic Word --- p.59 Chapter 4.2.2.4 --- Cross-word Contours --- p.62 Chapter 4.2.2.5 --- Phrase-initial Tone Contours --- p.65 Chapter 4.3. --- Summary --- p.66 Reference --- p.67 Chapter Chapter5 --- Prosody Modeling for Cantonese Text-to-Speech --- p.70 Chapter 5.1. --- Parametric Model and Non-parametric Model --- p.70 Chapter 5.2. --- Cantonese Text-to-Speech: Baseline System --- p.72 Chapter 5.2.1. --- Sub-syllable Unit --- p.72 Chapter 5.2.2. --- Text Analysis Module --- p.73 Chapter 5.2.3. --- Acoustical Synthesis --- p.74 Chapter 5.2.4. --- Prosody Module --- p.74 Chapter 5.3. --- Enhanced Prosody Model --- p.74 Chapter 5.3.1. --- Modeling Tone Contours --- p.75 Chapter 5.3.1.1 --- Word-level FO Contours --- p.76 Chapter 5.3.1.2 --- Phrase-initial Tone Contours --- p.77 Chapter 5.3.1.3 --- Tone Contours at Word Boundary --- p.78 Chapter 5.3.2. --- Modeling Phrase Curves --- p.79 Chapter 5.3.3. --- Generation of Continuous FO Contours --- p.81 Chapter 5.4. --- Summary --- p.81 Reference --- p.82 Chapter Chapter 6 --- Performance Evaluation --- p.83 Chapter 6.1. --- Introduction to Perceptual Test --- p.83 Chapter 6.1.1. --- Aspects of Evaluation --- p.84 Chapter 6.1.2. --- Methods of Judgment Test --- p.84 Chapter 6.1.3. --- Problems in Perceptual Test --- p.85 Chapter 6.2. --- Perceptual Tests for Cantonese TTS --- p.86 Chapter 6.2.1. --- Intelligibility Tests --- p.86 Chapter 6.2.1.1 --- Method --- p.86 Chapter 6.2.1.2 --- Results --- p.88 Chapter 6.2.1.3 --- Analysis --- p.89 Chapter 6.2.2. --- Naturalness Tests --- p.90 Chapter 6.2.2.1 --- Word-level --- p.90 Chapter 6.2.2.1.1 --- Method --- p.90 Chapter 6.2.2.1.2 --- Results --- p.91 Chapter 6.2.3.1.3 --- Analysis --- p.91 Chapter 6.2.2.2 --- Sentence-level --- p.92 Chapter 6.2.2.2.1 --- Method --- p.92 Chapter 6.2.2.2.2 --- Results --- p.93 Chapter 6.2.2.2.3 --- Analysis --- p.94 Chapter 6.3. --- Conclusions --- p.95 Chapter 6.4. --- Summary --- p.95 Reference --- p.96 Chapter Chapter 7 --- Conclusions and Future Work --- p.97 Chapter 7.1. --- Conclusions --- p.97 Chapter 7.2. --- Suggested Future Work --- p.99 Appendix --- p.100 Appendix 1 Linear Regression --- p.100 Appendix 2 36 Templates of Cross-word Contours --- p.101 Appendix 3 Word List for Word-level Tests --- p.102 Appendix 4 Syllable Occurrence in Word List of Intelligibility Test --- p.108 Appendix 5 Wrongly Identified Word List --- p.112 Appendix 6 Confusion Matrix --- p.115 Appendix 7 Unintelligible Word List --- p.117 Appendix 8 Noisy Word List --- p.119 Appendix 9 Sentence List for Naturalness Test --- p.120 Li, Yu Jia. Chinese University of Hong Kong Graduate School. Division of Electronic Engineering. 2003 Text bibliography print xi, 125 leaves : ill. ; 30 cm. cuhk:324229 http://library.cuhk.edu.hk/record=b5891678 eng chi Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/) http://repository.lib.cuhk.edu.hk/en/islandora/object/cuhk%3A324229/datastream/TN/view/Prosody%20analysis%20and%20modeling%20for%20Cantonese%20text-to-speech.jpghttp://repository.lib.cuhk.edu.hk/en/item/cuhk-324229