A BLSTM and WaveNet-Based Voice Conversion Method With Waveform Collapse Suppression by Post-Processing
In recent years, neural network-based voice conversion methods have been rapidly developed, and many different models and neural networks have been applied in parallel voice conversion. However, the over-smoothing of parametric methods [e.g., bidirectional long short-term memory (BLSTM)] and the wav...
Main Authors: | Xiaokong Miao, Xiongwei Zhang, Meng Sun, Changyan Zheng, Tieyong Cao |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8695725/ |
Similar Items
-
Non-Parallel Voice Conversion System With WaveNet Vocoder and Collapsed Speech Suppression
by: Yi-Chiao Wu, et al.
Published: (2020-01-01) -
Voice Conversion With CycleRNN-Based Spectral Mapping and Finely Tuned WaveNet Vocoder
by: Patrick Lumban Tobing, et al.
Published: (2019-01-01) -
Noise-Robust Voice Conversion Using High-Quefrency Boosting via Sub-Band Cepstrum Conversion and Fusion
by: Xiaokong Miao, et al.
Published: (2019-12-01) -
WaveNet With Cross-Attention for Audiovisual Speech Recognition
by: Hui Wang, et al.
Published: (2020-01-01) -
Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
by: Yi Zhao, et al.
Published: (2018-01-01)