A Study on Wide Band Speech Generation Based on G729

碩士 === 國立臺灣科技大學 === 資訊工程系 === 93 === In this thesis, we study the problem of improving the speech fidelity in Internet telephony under the restriction that only a small data rate overhead is tolerable. Our way is to add high frequency components into the speech signals. Based on the speech productio...

Full description

Bibliographic Details
Main Authors: Wang-Long Lee, 李旺隆
Other Authors: Hung-Yun Gu
Format: Others
Language:zh-TW
Published: 2004
Online Access:http://ndltd.ncl.edu.tw/handle/92084836042876277384
id ndltd-TW-093NTUST392001
record_format oai_dc
spelling ndltd-TW-093NTUST3920012016-06-13T04:17:34Z http://ndltd.ncl.edu.tw/handle/92084836042876277384 A Study on Wide Band Speech Generation Based on G729 基於G729之寬頻語音產生之研究 Wang-Long Lee 李旺隆 碩士 國立臺灣科技大學 資訊工程系 93 In this thesis, we study the problem of improving the speech fidelity in Internet telephony under the restriction that only a small data rate overhead is tolerable. Our way is to add high frequency components into the speech signals. Based on the speech production model of linear prediction, in the sender side, we analyze the voicing strength, gain and envelope of highband part of speech for every 10ms frame. Then, these analyzed parameter values are quantized and packed into 16 bits. This data rate is one fifth of the original rate of ITU G729 codec. In the receiver side harmonic and white noise generators are used to generate excitation signals, respectively. These two are mixed according to the voicing strength parameter. Then, the mixed excitation signal, after gain adjusting, is filtered by the all pole model constructed in terms of linear prediction coefficients. The obtained highband signal part is added with the lowband signal part generated by G729 decoder to synthesize the desired wideband speech. To evaluate the quality of the synthesized wideband speech, we first port the source code of G729 codec into the program of winRTP which is used for Internet telephony. Then, we implement the needed program modules for processing the highband part of speech. After integration of the system, we conduct a subjective perception tests in which the quality of the speech of original G729 and our wideband speech are measured. The result of the tests show that the quality of our bandwidth augmented speech has apparent improvement over the original G729 speech. Hung-Yun Gu 古鴻炎 2004 學位論文 ; thesis 45 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 93 === In this thesis, we study the problem of improving the speech fidelity in Internet telephony under the restriction that only a small data rate overhead is tolerable. Our way is to add high frequency components into the speech signals. Based on the speech production model of linear prediction, in the sender side, we analyze the voicing strength, gain and envelope of highband part of speech for every 10ms frame. Then, these analyzed parameter values are quantized and packed into 16 bits. This data rate is one fifth of the original rate of ITU G729 codec. In the receiver side harmonic and white noise generators are used to generate excitation signals, respectively. These two are mixed according to the voicing strength parameter. Then, the mixed excitation signal, after gain adjusting, is filtered by the all pole model constructed in terms of linear prediction coefficients. The obtained highband signal part is added with the lowband signal part generated by G729 decoder to synthesize the desired wideband speech. To evaluate the quality of the synthesized wideband speech, we first port the source code of G729 codec into the program of winRTP which is used for Internet telephony. Then, we implement the needed program modules for processing the highband part of speech. After integration of the system, we conduct a subjective perception tests in which the quality of the speech of original G729 and our wideband speech are measured. The result of the tests show that the quality of our bandwidth augmented speech has apparent improvement over the original G729 speech.
author2 Hung-Yun Gu
author_facet Hung-Yun Gu
Wang-Long Lee
李旺隆
author Wang-Long Lee
李旺隆
spellingShingle Wang-Long Lee
李旺隆
A Study on Wide Band Speech Generation Based on G729
author_sort Wang-Long Lee
title A Study on Wide Band Speech Generation Based on G729
title_short A Study on Wide Band Speech Generation Based on G729
title_full A Study on Wide Band Speech Generation Based on G729
title_fullStr A Study on Wide Band Speech Generation Based on G729
title_full_unstemmed A Study on Wide Band Speech Generation Based on G729
title_sort study on wide band speech generation based on g729
publishDate 2004
url http://ndltd.ncl.edu.tw/handle/92084836042876277384
work_keys_str_mv AT wanglonglee astudyonwidebandspeechgenerationbasedong729
AT lǐwànglóng astudyonwidebandspeechgenerationbasedong729
AT wanglonglee jīyúg729zhīkuānpínyǔyīnchǎnshēngzhīyánjiū
AT lǐwànglóng jīyúg729zhīkuānpínyǔyīnchǎnshēngzhīyánjiū
AT wanglonglee studyonwidebandspeechgenerationbasedong729
AT lǐwànglóng studyonwidebandspeechgenerationbasedong729
_version_ 1718304009084731392