A Study on Wide Band Speech Generation Based on G729
碩士 === 國立臺灣科技大學 === 資訊工程系 === 93 === In this thesis, we study the problem of improving the speech fidelity in Internet telephony under the restriction that only a small data rate overhead is tolerable. Our way is to add high frequency components into the speech signals. Based on the speech productio...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2004
|
Online Access: | http://ndltd.ncl.edu.tw/handle/92084836042876277384 |
id |
ndltd-TW-093NTUST392001 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093NTUST3920012016-06-13T04:17:34Z http://ndltd.ncl.edu.tw/handle/92084836042876277384 A Study on Wide Band Speech Generation Based on G729 基於G729之寬頻語音產生之研究 Wang-Long Lee 李旺隆 碩士 國立臺灣科技大學 資訊工程系 93 In this thesis, we study the problem of improving the speech fidelity in Internet telephony under the restriction that only a small data rate overhead is tolerable. Our way is to add high frequency components into the speech signals. Based on the speech production model of linear prediction, in the sender side, we analyze the voicing strength, gain and envelope of highband part of speech for every 10ms frame. Then, these analyzed parameter values are quantized and packed into 16 bits. This data rate is one fifth of the original rate of ITU G729 codec. In the receiver side harmonic and white noise generators are used to generate excitation signals, respectively. These two are mixed according to the voicing strength parameter. Then, the mixed excitation signal, after gain adjusting, is filtered by the all pole model constructed in terms of linear prediction coefficients. The obtained highband signal part is added with the lowband signal part generated by G729 decoder to synthesize the desired wideband speech. To evaluate the quality of the synthesized wideband speech, we first port the source code of G729 codec into the program of winRTP which is used for Internet telephony. Then, we implement the needed program modules for processing the highband part of speech. After integration of the system, we conduct a subjective perception tests in which the quality of the speech of original G729 and our wideband speech are measured. The result of the tests show that the quality of our bandwidth augmented speech has apparent improvement over the original G729 speech. Hung-Yun Gu 古鴻炎 2004 學位論文 ; thesis 45 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 資訊工程系 === 93 === In this thesis, we study the problem of improving the speech fidelity in Internet telephony under the restriction that only a small data rate overhead is tolerable. Our way is to add high frequency components into the speech signals. Based on the speech production model of linear prediction, in the sender side, we analyze the voicing strength, gain and envelope of highband part of speech for every 10ms frame. Then, these analyzed parameter values are quantized and packed into 16 bits. This data rate is one fifth of the original rate of ITU G729 codec. In the receiver side harmonic and white noise generators are used to generate excitation signals, respectively. These two are mixed according to the voicing strength parameter. Then, the mixed excitation signal, after gain adjusting, is filtered by the all pole model constructed in terms of linear prediction coefficients. The obtained highband signal part is added with the lowband signal part generated by G729 decoder to synthesize the desired wideband speech.
To evaluate the quality of the synthesized wideband speech, we first port the source code of G729 codec into the program of winRTP which is used for Internet telephony. Then, we implement the needed program modules for processing the highband part of speech. After integration of the system, we conduct a subjective perception tests in which the quality of the speech of original G729 and our wideband speech are measured. The result of the tests show that the quality of our bandwidth augmented speech has apparent improvement over the original G729 speech.
|
author2 |
Hung-Yun Gu |
author_facet |
Hung-Yun Gu Wang-Long Lee 李旺隆 |
author |
Wang-Long Lee 李旺隆 |
spellingShingle |
Wang-Long Lee 李旺隆 A Study on Wide Band Speech Generation Based on G729 |
author_sort |
Wang-Long Lee |
title |
A Study on Wide Band Speech Generation Based on G729 |
title_short |
A Study on Wide Band Speech Generation Based on G729 |
title_full |
A Study on Wide Band Speech Generation Based on G729 |
title_fullStr |
A Study on Wide Band Speech Generation Based on G729 |
title_full_unstemmed |
A Study on Wide Band Speech Generation Based on G729 |
title_sort |
study on wide band speech generation based on g729 |
publishDate |
2004 |
url |
http://ndltd.ncl.edu.tw/handle/92084836042876277384 |
work_keys_str_mv |
AT wanglonglee astudyonwidebandspeechgenerationbasedong729 AT lǐwànglóng astudyonwidebandspeechgenerationbasedong729 AT wanglonglee jīyúg729zhīkuānpínyǔyīnchǎnshēngzhīyánjiū AT lǐwànglóng jīyúg729zhīkuānpínyǔyīnchǎnshēngzhīyánjiū AT wanglonglee studyonwidebandspeechgenerationbasedong729 AT lǐwànglóng studyonwidebandspeechgenerationbasedong729 |
_version_ |
1718304009084731392 |