Character-Level Linguistic Features Extraction for Text-to-Speech System

碩士 === 國立臺北科技大學 === 電子工程系研究所 === 104 === Good context dependent is a key part of the speech synthesis, the traditional context dependent depend on NLP (Natural Language Processing, NLP) parser text analysis. It is hence difficult to design one especially for speech synthesis. To alleviate these draw...

Full description

Bibliographic Details
Main Authors: Kuan-Hung Chen, 陳冠宏
Other Authors: Yuan-Fu Liao
Format: Others
Language:zh-TW
Online Access:http://ndltd.ncl.edu.tw/handle/nh7kh2
Description
Summary:碩士 === 國立臺北科技大學 === 電子工程系研究所 === 104 === Good context dependent is a key part of the speech synthesis, the traditional context dependent depend on NLP (Natural Language Processing, NLP) parser text analysis. It is hence difficult to design one especially for speech synthesis. To alleviate these drawbacks, establishing an end-to-end speech synthesis system, we propose to use character-level word2vector and recurrent neural networks (RNNs) to directly convert input character sequences into latent linguistic feature vectors for context dependent. In the end, we use a mixed English-Chinese speech synthesis system to test this idea. Experimental results show that proposed approach provides comparable performance with conventional NLP parser-based methods.