Very Low Rate Scalable Speech Coding through Classified Embedded Matrix Quantization
<p/> <p>This paper proposes a scalable speech coding scheme using the embedded matrix quantization of the LSFs in the LPC model. For an efficient quantization of the spectral parameters, two types of codebooks of different sizes are designed and used to encode unvoiced and mixed voicing...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2010-01-01
|
Series: | EURASIP Journal on Advances in Signal Processing |
Online Access: | http://asp.eurasipjournals.com/content/2010/480345 |
id |
doaj-8e69287649b747cb9919db5c52f352cb |
---|---|
record_format |
Article |
spelling |
doaj-8e69287649b747cb9919db5c52f352cb2020-11-24T21:15:21ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802010-01-0120101480345Very Low Rate Scalable Speech Coding through Classified Embedded Matrix QuantizationGhaemmaghami ShahrokhJahangiri Ehsan<p/> <p>This paper proposes a scalable speech coding scheme using the embedded matrix quantization of the LSFs in the LPC model. For an efficient quantization of the spectral parameters, two types of codebooks of different sizes are designed and used to encode unvoiced and mixed voicing segments separately. The tree-like structured codebooks of our embedded quantizer, constructed through a cell merging process, help to make a fine-grain scalable speech coder. Using an efficient adaptive dual-band approximation of the LPC excitation, where voicing transition frequency is determined based on the concept of instantaneous frequency in the frequency domain, near natural sounding synthesized speech is achieved. Assessment results, including both overall quality and intelligibility scores show that the proposed coding scheme can be a reasonable choice for speech coding in low bandwidth communication applications.</p>http://asp.eurasipjournals.com/content/2010/480345 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ghaemmaghami Shahrokh Jahangiri Ehsan |
spellingShingle |
Ghaemmaghami Shahrokh Jahangiri Ehsan Very Low Rate Scalable Speech Coding through Classified Embedded Matrix Quantization EURASIP Journal on Advances in Signal Processing |
author_facet |
Ghaemmaghami Shahrokh Jahangiri Ehsan |
author_sort |
Ghaemmaghami Shahrokh |
title |
Very Low Rate Scalable Speech Coding through Classified Embedded Matrix Quantization |
title_short |
Very Low Rate Scalable Speech Coding through Classified Embedded Matrix Quantization |
title_full |
Very Low Rate Scalable Speech Coding through Classified Embedded Matrix Quantization |
title_fullStr |
Very Low Rate Scalable Speech Coding through Classified Embedded Matrix Quantization |
title_full_unstemmed |
Very Low Rate Scalable Speech Coding through Classified Embedded Matrix Quantization |
title_sort |
very low rate scalable speech coding through classified embedded matrix quantization |
publisher |
SpringerOpen |
series |
EURASIP Journal on Advances in Signal Processing |
issn |
1687-6172 1687-6180 |
publishDate |
2010-01-01 |
description |
<p/> <p>This paper proposes a scalable speech coding scheme using the embedded matrix quantization of the LSFs in the LPC model. For an efficient quantization of the spectral parameters, two types of codebooks of different sizes are designed and used to encode unvoiced and mixed voicing segments separately. The tree-like structured codebooks of our embedded quantizer, constructed through a cell merging process, help to make a fine-grain scalable speech coder. Using an efficient adaptive dual-band approximation of the LPC excitation, where voicing transition frequency is determined based on the concept of instantaneous frequency in the frequency domain, near natural sounding synthesized speech is achieved. Assessment results, including both overall quality and intelligibility scores show that the proposed coding scheme can be a reasonable choice for speech coding in low bandwidth communication applications.</p> |
url |
http://asp.eurasipjournals.com/content/2010/480345 |
work_keys_str_mv |
AT ghaemmaghamishahrokh verylowratescalablespeechcodingthroughclassifiedembeddedmatrixquantization AT jahangiriehsan verylowratescalablespeechcodingthroughclassifiedembeddedmatrixquantization |
_version_ |
1716745594037338112 |