A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs
碩士 === 南台科技大學 === 資訊工程系 === 98 === Speech communication is the most common service in the Internet telecommunication and multimedia process. However, since speech signal should be continuously sent back, the voice in the service of the Internet should collect enough speech data, which can cause larg...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/72260792424405854470 |
id |
ndltd-TW-098STUT8392005 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098STUT83920052016-11-22T04:13:28Z http://ndltd.ncl.edu.tw/handle/72260792424405854470 A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs ITU-T G.723.1與G.729語音編碼器快速演算法之研究 Jia-Yu Wang 王嘉宇 碩士 南台科技大學 資訊工程系 98 Speech communication is the most common service in the Internet telecommunication and multimedia process. However, since speech signal should be continuously sent back, the voice in the service of the Internet should collect enough speech data, which can cause large speech delay and can degrade the speech quality in a limited network bandwidth. To achieve "continuity", speech codec with high compression rate has been used to generate a low-rate data stream, but that codec requires higher computational complexity. Thus, reducing the bit rate and improving speech quality of codec is the most significant. ITU-T offers the G.723.1and G.729 codecs that have used popularly in the Internet applications. These codecs offer high quality and low bit rate coding constitution. This paper predict the search range of adaptive codebook-gain in the G.723.1 standard codec by minimizing the mean square error between the three-tap excitation signal with its residual signal and one-tap pitch predictor. For the G.723.1 MP-MLQ, we propose a fast search algorithm by using a designed energy function and the multi-track positions structure of the stochastic excitation signals to predict the candidate pulses for each subframe. As for both of the G.723.1 and the G.729 ACELP codebook, we base on depth-first tree search (DFS) and pulse-position likelihood-estimate to propose a fast search algorithm. As the two encoders belong to CELP coding structure, transcoding procedures are completed through two processes: line spectral pair and pitch conversions. They are all used to linear interpolation processing. For further computational complexity reduction, we use two fast search algorithms. First, we employ residual signals to predict candidate gain-vectors of adaptive-codebook in the G.723.1. Next, we adopt fast stochastic excitation pulses search method. Simulation results show that the proposed methods reduce a large amount of computation. Also, reconstructed speech signal still maintain a certain level of speech quality with perceptually negligible degradation. Rong-San Lin 林榮三 2010 學位論文 ; thesis 88 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 南台科技大學 === 資訊工程系 === 98 === Speech communication is the most common service in the Internet telecommunication and multimedia process. However, since speech signal should be continuously sent back, the voice in the service of the Internet should collect enough speech data, which can cause large speech delay and can degrade the speech quality in a limited network bandwidth. To achieve "continuity", speech codec with high compression rate has been used to generate a low-rate data stream, but that codec requires higher computational complexity. Thus, reducing the bit rate and improving speech quality of codec is the most significant. ITU-T offers the G.723.1and G.729 codecs that have used popularly in the Internet applications. These codecs offer high quality and low bit rate coding constitution.
This paper predict the search range of adaptive codebook-gain in the G.723.1 standard codec by minimizing the mean square error between the three-tap excitation signal with its residual signal and one-tap pitch predictor. For the G.723.1 MP-MLQ, we propose a fast search algorithm by using a designed energy function and the multi-track positions structure of the stochastic excitation signals to predict the candidate pulses for each subframe. As for both of the G.723.1 and the G.729 ACELP codebook, we base on depth-first tree search (DFS) and pulse-position likelihood-estimate to propose a fast search algorithm. As the two encoders belong to CELP coding structure, transcoding procedures are completed through two processes: line spectral pair and pitch conversions. They are all used to linear interpolation processing. For further computational complexity reduction, we use two fast search algorithms. First, we employ residual signals to predict candidate gain-vectors of adaptive-codebook in the G.723.1. Next, we adopt fast stochastic excitation pulses search method. Simulation results show that the proposed methods reduce a large amount of computation. Also, reconstructed speech signal still maintain a certain level of speech quality with perceptually negligible degradation.
|
author2 |
Rong-San Lin |
author_facet |
Rong-San Lin Jia-Yu Wang 王嘉宇 |
author |
Jia-Yu Wang 王嘉宇 |
spellingShingle |
Jia-Yu Wang 王嘉宇 A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs |
author_sort |
Jia-Yu Wang |
title |
A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs |
title_short |
A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs |
title_full |
A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs |
title_fullStr |
A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs |
title_full_unstemmed |
A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs |
title_sort |
study on fast coding algorithm for itu-t g.723.1 and g.729 speech codecs |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/72260792424405854470 |
work_keys_str_mv |
AT jiayuwang astudyonfastcodingalgorithmforitutg7231andg729speechcodecs AT wángjiāyǔ astudyonfastcodingalgorithmforitutg7231andg729speechcodecs AT jiayuwang itutg7231yǔg729yǔyīnbiānmǎqìkuàisùyǎnsuànfǎzhīyánjiū AT wángjiāyǔ itutg7231yǔg729yǔyīnbiānmǎqìkuàisùyǎnsuànfǎzhīyánjiū AT jiayuwang studyonfastcodingalgorithmforitutg7231andg729speechcodecs AT wángjiāyǔ studyonfastcodingalgorithmforitutg7231andg729speechcodecs |
_version_ |
1718397260236062720 |