Summary: | 碩士 === 國立交通大學 === 電信工程研究所 === 103 === In this research, we investigate prediction of prosodic break tag of Mandarin Chinese speech synthesizer. We only can access linguistic features with parser to predict prosodic break tag. Previous researches mainly used linguistic features of word level (example: POS) and sentence level (example: distance between itself and punctuations) to predict break tag of inter-syllable juncture. However, in this research we would like to add compound words and two types of special phrase (de phrase and Conjunctions phrase), in order to assist in improving the prediction of break tag. According to the experimental results on a speech corpus containing 376 utterances of normal speaking rate, we authenticate that linguistic features which add compound word and phrase are really effective to prediction of prosodic break tag. Thus, there are 7 kinds of predicting correct rate of break tag rise to 74.05% from 70.34%. The mainly improvement is the break of pitch reset of prosody word boundary (B2-1), the break of short pause (B2-2) and the break of prosody phrase boundary (B3). Therefore, above improvement brings more effective break rhythm to continuous TTS.
|