Speech Spectrum Coding Based on KLT and Power Series

碩士 === 國立臺北科技大學 === 電機工程系研究所 === 97 === Most of low bit-rate speech coders based on linear predictive coding (LPC) model use line spectrum frequencies (LSF) to represent the short-time spectrum of speech. The Karhunen-Loeve transform power-series vector quantizer (KLT-PSQ) method is proposed in the...

Full description

Bibliographic Details
Main Authors: Hsing-Hsu Chen, 陳星旭
Other Authors: 簡福榮
Format: Others
Language:zh-TW
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/9aa5hb
Description
Summary:碩士 === 國立臺北科技大學 === 電機工程系研究所 === 97 === Most of low bit-rate speech coders based on linear predictive coding (LPC) model use line spectrum frequencies (LSF) to represent the short-time spectrum of speech. The Karhunen-Loeve transform power-series vector quantizer (KLT-PSQ) method is proposed in the thesis to efficiently decrease the bit rates. The multi-stage vector quantization (MSVQ) that is adopted by mixed excitation linear prediction (MELP) coder needs 24-25 bits/frame to reach an spectral distortion of 1 dB. KLT-PSQ1 and KLT-PSQ2 can maintain the same spectral transparency at only 20-21 bits/frame. The scheme that includes both KLT-PSQ1 and a fixed memoryless VQ controlled by a switch is denoted as KLT-PSQ3. KLT-PSQ3 is shown to be able to significantly decrease error propagation. Over the memoryless binary symmetric channel (BSC) with error rates from 0 % to 0.5 %, the experimental result shows that the performance of KLT-PSQ3 is more robust than that of MSVQ. However, the BSC with error rate from 1 % to 10 %, the advantage is obviously canceled out by error propagation.