Character-Level Linguistic Features Extraction for Text-to-Speech System
碩士 === 國立臺北科技大學 === 電子工程系研究所 === 104 === Good context dependent is a key part of the speech synthesis, the traditional context dependent depend on NLP (Natural Language Processing, NLP) parser text analysis. It is hence difficult to design one especially for speech synthesis. To alleviate these draw...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Online Access: | http://ndltd.ncl.edu.tw/handle/nh7kh2 |
id |
ndltd-TW-104TIT05427121 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104TIT054271212019-05-15T23:53:22Z http://ndltd.ncl.edu.tw/handle/nh7kh2 Character-Level Linguistic Features Extraction for Text-to-Speech System 基於字元階層之語言合成文脈訊息特徵參數擷取 Kuan-Hung Chen 陳冠宏 碩士 國立臺北科技大學 電子工程系研究所 104 Good context dependent is a key part of the speech synthesis, the traditional context dependent depend on NLP (Natural Language Processing, NLP) parser text analysis. It is hence difficult to design one especially for speech synthesis. To alleviate these drawbacks, establishing an end-to-end speech synthesis system, we propose to use character-level word2vector and recurrent neural networks (RNNs) to directly convert input character sequences into latent linguistic feature vectors for context dependent. In the end, we use a mixed English-Chinese speech synthesis system to test this idea. Experimental results show that proposed approach provides comparable performance with conventional NLP parser-based methods. Yuan-Fu Liao 廖元甫 學位論文 ; thesis 0 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺北科技大學 === 電子工程系研究所 === 104 === Good context dependent is a key part of the speech synthesis, the traditional context dependent depend on NLP (Natural Language Processing, NLP) parser text analysis. It is hence difficult to design one especially for speech synthesis. To alleviate these drawbacks, establishing an end-to-end speech synthesis system, we propose to use character-level word2vector and recurrent neural networks (RNNs) to directly convert input character sequences into latent linguistic feature vectors for context dependent. In the end, we use a mixed English-Chinese speech synthesis system to test this idea. Experimental results show that proposed approach provides comparable performance with conventional NLP parser-based methods.
|
author2 |
Yuan-Fu Liao |
author_facet |
Yuan-Fu Liao Kuan-Hung Chen 陳冠宏 |
author |
Kuan-Hung Chen 陳冠宏 |
spellingShingle |
Kuan-Hung Chen 陳冠宏 Character-Level Linguistic Features Extraction for Text-to-Speech System |
author_sort |
Kuan-Hung Chen |
title |
Character-Level Linguistic Features Extraction for Text-to-Speech System |
title_short |
Character-Level Linguistic Features Extraction for Text-to-Speech System |
title_full |
Character-Level Linguistic Features Extraction for Text-to-Speech System |
title_fullStr |
Character-Level Linguistic Features Extraction for Text-to-Speech System |
title_full_unstemmed |
Character-Level Linguistic Features Extraction for Text-to-Speech System |
title_sort |
character-level linguistic features extraction for text-to-speech system |
url |
http://ndltd.ncl.edu.tw/handle/nh7kh2 |
work_keys_str_mv |
AT kuanhungchen characterlevellinguisticfeaturesextractionfortexttospeechsystem AT chénguānhóng characterlevellinguisticfeaturesextractionfortexttospeechsystem AT kuanhungchen jīyúzìyuánjiēcéngzhīyǔyánhéchéngwénmàixùnxītèzhēngcānshùxiéqǔ AT chénguānhóng jīyúzìyuánjiēcéngzhīyǔyánhéchéngwénmàixùnxītèzhēngcānshùxiéqǔ |
_version_ |
1719155898014760960 |