Statistical Parametric Speech Synthesis using Deep Learning Architectures

Statistical Parametric Speech Synthesis using Deep Learning Architectures

本文研究了使用深度學習(Deep Learning)技術與模型的統計參數化語音合成(Statistical Parametric Speech Synthesis)框架。當前語音合成面臨的兩個主要的挑戰在於：採用聲學實現表達語音韻律的複雜度；訓練數據的稀疏性。這兩個問題很大地影響了合成語音的自然度。本文嘗試採用深度學習結構的建模能力，提高合成語音的語音自然度。 === 為了更精確地表示韻律上下文，本文定義了層次韻律結構，用以組織音段與超音段特征。本文採用深度學習結構，運用層次化結構的音節級別表示，構建語音合成系統。 === 受深度置信網絡(Deep Belief Network, DBN)在手...

Full description

Bibliographic Details
Other Authors:	Kang, Shiyin (author.)
Format:	Others
Language:	English Chinese
Published:	2016
Subjects:
Online Access:	http://repository.lib.cuhk.edu.hk/en/item/cuhk-1292251

Similar Items

Discriminative Multi-Stream Postfilters Based on Deep Learning for Enhancing Statistical Parametric Speech Synthesis
by: Marvin Coto-Jiménez
Published: (2021-02-01)

Visual speech synthesis using dynamic visemes and deep learning architectures
by: Thangthai, Ausdang
Published: (2018)

Malay statistical parametric speech synthesis with intelligibility improvement using artificial intelligence
by: Lau, Chee Yong
Published: (2015)

A Review of Serbian Parametric Speech Synthesis Based on Deep Neural Networks
by: T. Delić, et al.
Published: (2017-06-01)

An Experimental Analysis of Deep Learning Architectures for Supervised Speech Enhancement
by: Soha A. Nossier, et al.
Published: (2021-12-01)

Adaptive Refinements of Pitch Tracking and HNR Estimation within a Vocoder for Statistical Parametric Speech Synthesis
by: Mohammed Salah Al-Radhi, et al.
Published: (2019-06-01)

A parametric monophone speech synthesis system
by: Klompje, Gideon
Published: (2008)

An improvement of speech synthesis by using prosodic information and deep learning models
by: Chiang, Jen-Chieh, et al.
Published: (2019)

A Review of Deep Learning Based Speech Synthesis
by: Yishuang Ning, et al.
Published: (2019-09-01)

Deep Learning for Mandarin-Tibetan Cross-Lingual Speech Synthesis
by: Weizhao Zhang, et al.
Published: (2019-01-01)

Bidirectional deep architecture for Arabic speech recognition
by: Zerari Naima, et al.
Published: (2019-04-01)

Algorithms and VLSI architectures for parametric additive synthesis
by: Spanier, Jonathan Robert
Published: (1999)

Expression of basic emotions in Estonian parametric text-to-speech synthesis
by: Kairi Tamuri, et al.
Published: (2015-12-01)

Development and Evaluation of Speech Synthesis System Based on Deep Learning Models
by: Alakbar Valizada, et al.
Published: (2021-05-01)

Efficient Feature-Aware Hybrid Model of Deep Learning Architectures for Speech Emotion Recognition
by: Mai Ezz-Eldin, et al.
Published: (2021-01-01)

Statistical analysis, modelling and synthesis of voice for text to speech synthesis
by: Low, Phuay Hui
Published: (2004)

Statistical parametric evaluation on new corpus design for Malay speech articulation disorder early diagnosis
by: Mazenan, Mohd. Nizam, et al.
Published: (2015)

Statistical Parametric Models and Inference for Biomedical Signal Processing: Applications in Speech and Magnetic Resonance Imaging
by: Hong, Jung
Published: (2013)

MAKEDONKA: Applied Deep Learning Model for Text-to-Speech Synthesis in Macedonian Language
by: Kostadin Mishev, et al.
Published: (2020-10-01)

Speech recognition based on spectrograms by using deep learning
by: Leon, Roy Eduardo Aguilar
Published: (2018)

Deep unsupervised learning from speech
by: Drexler, Jennifer Fox
Published: (2016)

Deep learning-based methods for parametric shape prediction
by: Smirnov, Dmitriy,S.M.Massachusetts Institute of Technology.
Published: (2019)

Statistical learning in network architecture
by: Beverly, Robert E., 1975-
Published: (2009)

Parametric Speech Emotion Recognition Using Neural Network
by: Ma, Rui
Published: (2014)

Deep Learning for Speech Enhancement : A Study on WaveNet, GANs and General CNN-RNN Architectures
by: Xing Luo, Oscar
Published: (2019)

Lithuanian Speech Recognition Using Purely Phonetic Deep Learning
by: Laurynas Pipiras, et al.
Published: (2019-10-01)

Arabic speech recognition using end‐to‐end deep learning
by: Hamzah A. Alsayadi, et al.
Published: (2021-10-01)

Speech Enhancement Using Deep Learning Methods: A Review
by: Asri Rizki Yuliani, et al.
Published: (2021-08-01)

Crosslingual Acoustic Modeling in Speech Recognition Using Deep Learning
by: Hsiang-Hung Lu, et al.
Published: (2016)

Parametric variation in architecture.
Published: (2010)

T test as a parametric statistic
by: Tae Kyun Kim
Published: (2015-12-01)

Feedforward deep architectures for classification and synthesis
by: Warde-Farley, David
Published: (2018)

Modelling the fisheries of lake manzala, egypt, using parametric and non-parametric statistical methods
by: Abdelaal, Medhat Mohamed Ahmed
Published: (1999)

Deep Factorized and Variational Learning for Speech Recognition
by: Shen, Chen, et al.
Published: (2016)

Time interval statistics in speech synthesis: A critical evaluation
by: Underwood, M. J.
Published: (1968)

Enhancing Speech Recognition by Deep UnsupervisedLearning
by: Yen-Ju Lu, et al.
Published: (2017)

Deep Learning Applied to Speech Enhancement Algorithm
by: Yu-HsuanHuang, et al.
Published: (2018)

Deep Learning Architecture and Applications
Published: (2023)

A Smart Binaural Hearing Aid Architecture Leveraging a Smartphone APP With Deep-Learning Speech Enhancement
by: Yingdan Li, et al.
Published: (2020-01-01)

Deep learning: a statistical viewpoint
by: Bartlett, Peter L, et al.
Published: (2021)