Synthesis by Concatenation of Waveform Segment

碩士 === 淡江大學 === 資訊工程研究所 === 82 === In this paper, we devote to synthesize Chinese word byans of waveform splicing base on phoneme and pitch period. To synthesiae speech by concatenation of phoneme or pitchriod naturally and fluently, there...

Full description

Bibliographic Details
Main Authors: Shiou-Ming Chu, 朱修明
Other Authors: Ching-Tang Hsieh
Format: Others
Language:zh-TW
Published: 1994
Online Access:http://ndltd.ncl.edu.tw/handle/61073183594518265938
id ndltd-TW-082TKU00392005
record_format oai_dc
spelling ndltd-TW-082TKU003920052016-02-08T04:06:32Z http://ndltd.ncl.edu.tw/handle/61073183594518265938 Synthesis by Concatenation of Waveform Segment 波形編集語音合成之研究(II) Shiou-Ming Chu 朱修明 碩士 淡江大學 資訊工程研究所 82 In this paper, we devote to synthesize Chinese word byans of waveform splicing base on phoneme and pitch period. To synthesiae speech by concatenation of phoneme or pitchriod naturally and fluently, there are two important thingsconcemed : 1. How to capture the fundamental waveform. 2.handle the situations when splicing. About these problems, we provide some processes. Theirprovement in quilities of synthesized speech was confirmed byriments: (1) How to capture the fundamental waveform: The Quility of synthesized speech is highly dependent on the selection method of speech waveform while building speech database. There are two types of speech signal we want toapture: consonants and vowels. 1. When collecting consonant: we had retain pure consonant part and some periods of vowel which adjoin to. In this way, we can make the CV type concatenation easily and fluently. 2. When collecting vowel: using FFT Ceptrum to estimate pitch of speech segament. We can find a maximum amplitude (MA) in the pitch period. Finding the nearest zero-crossing point before MA as a starting point of pitch period and regarding the previous point of next period' srting point as ending point. Repeat the capturess, we retain nine pitch periods in the range of 3. The high sampling rate is applied to build speech database. The higher resolution of speech waveform can reduce the distortion of interpolation process. Thus the synthesized speech with higher sample-rate database islosre to original speech than the lower one. (2) How to handle the situations when splicing: The concatenation problem take place in these situations: 1. Concatenation of consonant and vowel (CV type):Concatenation of pitch periods in a vowel (Vtype): Ching-Tang Hsieh 謝景棠 1994 學位論文 ; thesis 77 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 淡江大學 === 資訊工程研究所 === 82 === In this paper, we devote to synthesize Chinese word byans of waveform splicing base on phoneme and pitch period. To synthesiae speech by concatenation of phoneme or pitchriod naturally and fluently, there are two important thingsconcemed : 1. How to capture the fundamental waveform. 2.handle the situations when splicing. About these problems, we provide some processes. Theirprovement in quilities of synthesized speech was confirmed byriments: (1) How to capture the fundamental waveform: The Quility of synthesized speech is highly dependent on the selection method of speech waveform while building speech database. There are two types of speech signal we want toapture: consonants and vowels. 1. When collecting consonant: we had retain pure consonant part and some periods of vowel which adjoin to. In this way, we can make the CV type concatenation easily and fluently. 2. When collecting vowel: using FFT Ceptrum to estimate pitch of speech segament. We can find a maximum amplitude (MA) in the pitch period. Finding the nearest zero-crossing point before MA as a starting point of pitch period and regarding the previous point of next period' srting point as ending point. Repeat the capturess, we retain nine pitch periods in the range of 3. The high sampling rate is applied to build speech database. The higher resolution of speech waveform can reduce the distortion of interpolation process. Thus the synthesized speech with higher sample-rate database islosre to original speech than the lower one. (2) How to handle the situations when splicing: The concatenation problem take place in these situations: 1. Concatenation of consonant and vowel (CV type):Concatenation of pitch periods in a vowel (Vtype):
author2 Ching-Tang Hsieh
author_facet Ching-Tang Hsieh
Shiou-Ming Chu
朱修明
author Shiou-Ming Chu
朱修明
spellingShingle Shiou-Ming Chu
朱修明
Synthesis by Concatenation of Waveform Segment
author_sort Shiou-Ming Chu
title Synthesis by Concatenation of Waveform Segment
title_short Synthesis by Concatenation of Waveform Segment
title_full Synthesis by Concatenation of Waveform Segment
title_fullStr Synthesis by Concatenation of Waveform Segment
title_full_unstemmed Synthesis by Concatenation of Waveform Segment
title_sort synthesis by concatenation of waveform segment
publishDate 1994
url http://ndltd.ncl.edu.tw/handle/61073183594518265938
work_keys_str_mv AT shioumingchu synthesisbyconcatenationofwaveformsegment
AT zhūxiūmíng synthesisbyconcatenationofwaveformsegment
AT shioumingchu bōxíngbiānjíyǔyīnhéchéngzhīyánjiūii
AT zhūxiūmíng bōxíngbiānjíyǔyīnhéchéngzhīyánjiūii
_version_ 1718182606841839616