A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs

碩士 === 南台科技大學 === 資訊工程系 === 98 === Speech communication is the most common service in the Internet telecommunication and multimedia process. However, since speech signal should be continuously sent back, the voice in the service of the Internet should collect enough speech data, which can cause larg...

Full description

Bibliographic Details
Main Authors: Jia-Yu Wang, 王嘉宇
Other Authors: Rong-San Lin
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/72260792424405854470
id ndltd-TW-098STUT8392005
record_format oai_dc
spelling ndltd-TW-098STUT83920052016-11-22T04:13:28Z http://ndltd.ncl.edu.tw/handle/72260792424405854470 A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs ITU-T G.723.1與G.729語音編碼器快速演算法之研究 Jia-Yu Wang 王嘉宇 碩士 南台科技大學 資訊工程系 98 Speech communication is the most common service in the Internet telecommunication and multimedia process. However, since speech signal should be continuously sent back, the voice in the service of the Internet should collect enough speech data, which can cause large speech delay and can degrade the speech quality in a limited network bandwidth. To achieve "continuity", speech codec with high compression rate has been used to generate a low-rate data stream, but that codec requires higher computational complexity. Thus, reducing the bit rate and improving speech quality of codec is the most significant. ITU-T offers the G.723.1and G.729 codecs that have used popularly in the Internet applications. These codecs offer high quality and low bit rate coding constitution. This paper predict the search range of adaptive codebook-gain in the G.723.1 standard codec by minimizing the mean square error between the three-tap excitation signal with its residual signal and one-tap pitch predictor. For the G.723.1 MP-MLQ, we propose a fast search algorithm by using a designed energy function and the multi-track positions structure of the stochastic excitation signals to predict the candidate pulses for each subframe. As for both of the G.723.1 and the G.729 ACELP codebook, we base on depth-first tree search (DFS) and pulse-position likelihood-estimate to propose a fast search algorithm. As the two encoders belong to CELP coding structure, transcoding procedures are completed through two processes: line spectral pair and pitch conversions. They are all used to linear interpolation processing. For further computational complexity reduction, we use two fast search algorithms. First, we employ residual signals to predict candidate gain-vectors of adaptive-codebook in the G.723.1. Next, we adopt fast stochastic excitation pulses search method. Simulation results show that the proposed methods reduce a large amount of computation. Also, reconstructed speech signal still maintain a certain level of speech quality with perceptually negligible degradation. Rong-San Lin 林榮三 2010 學位論文 ; thesis 88 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 南台科技大學 === 資訊工程系 === 98 === Speech communication is the most common service in the Internet telecommunication and multimedia process. However, since speech signal should be continuously sent back, the voice in the service of the Internet should collect enough speech data, which can cause large speech delay and can degrade the speech quality in a limited network bandwidth. To achieve "continuity", speech codec with high compression rate has been used to generate a low-rate data stream, but that codec requires higher computational complexity. Thus, reducing the bit rate and improving speech quality of codec is the most significant. ITU-T offers the G.723.1and G.729 codecs that have used popularly in the Internet applications. These codecs offer high quality and low bit rate coding constitution. This paper predict the search range of adaptive codebook-gain in the G.723.1 standard codec by minimizing the mean square error between the three-tap excitation signal with its residual signal and one-tap pitch predictor. For the G.723.1 MP-MLQ, we propose a fast search algorithm by using a designed energy function and the multi-track positions structure of the stochastic excitation signals to predict the candidate pulses for each subframe. As for both of the G.723.1 and the G.729 ACELP codebook, we base on depth-first tree search (DFS) and pulse-position likelihood-estimate to propose a fast search algorithm. As the two encoders belong to CELP coding structure, transcoding procedures are completed through two processes: line spectral pair and pitch conversions. They are all used to linear interpolation processing. For further computational complexity reduction, we use two fast search algorithms. First, we employ residual signals to predict candidate gain-vectors of adaptive-codebook in the G.723.1. Next, we adopt fast stochastic excitation pulses search method. Simulation results show that the proposed methods reduce a large amount of computation. Also, reconstructed speech signal still maintain a certain level of speech quality with perceptually negligible degradation.
author2 Rong-San Lin
author_facet Rong-San Lin
Jia-Yu Wang
王嘉宇
author Jia-Yu Wang
王嘉宇
spellingShingle Jia-Yu Wang
王嘉宇
A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs
author_sort Jia-Yu Wang
title A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs
title_short A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs
title_full A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs
title_fullStr A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs
title_full_unstemmed A Study on Fast Coding Algorithm for ITU-T G.723.1 and G.729 Speech Codecs
title_sort study on fast coding algorithm for itu-t g.723.1 and g.729 speech codecs
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/72260792424405854470
work_keys_str_mv AT jiayuwang astudyonfastcodingalgorithmforitutg7231andg729speechcodecs
AT wángjiāyǔ astudyonfastcodingalgorithmforitutg7231andg729speechcodecs
AT jiayuwang itutg7231yǔg729yǔyīnbiānmǎqìkuàisùyǎnsuànfǎzhīyánjiū
AT wángjiāyǔ itutg7231yǔg729yǔyīnbiānmǎqìkuàisùyǎnsuànfǎzhīyánjiū
AT jiayuwang studyonfastcodingalgorithmforitutg7231andg729speechcodecs
AT wángjiāyǔ studyonfastcodingalgorithmforitutg7231andg729speechcodecs
_version_ 1718397260236062720