The Architecture and Chip Implementation for Speech Recognition and Voice Compression Systems

博士 === 國立成功大學 === 資訊工程學系 === 85 === In this disserpation, the efficient and flexible VLSI architecture andimplementation for the voice word-recognizer and CELP voice compression systemare presented. In order to achieve a flexible and efficient VLSI realiz...

Full description

Bibliographic Details
Main Authors: Suen, An-Nan, 孫安南
Other Authors: Jhing-Fa Wang
Format: Others
Language:zh-TW
Published: 1997
Online Access:http://ndltd.ncl.edu.tw/handle/94585011867419107779
id ndltd-TW-085NCKU0392014
record_format oai_dc
spelling ndltd-TW-085NCKU03920142015-10-13T12:18:05Z http://ndltd.ncl.edu.tw/handle/94585011867419107779 The Architecture and Chip Implementation for Speech Recognition and Voice Compression Systems 語音辨識及語音壓縮系統之超大型積體電路架構與晶片設計 Suen, An-Nan 孫安南 博士 國立成功大學 資訊工程學系 85 In this disserpation, the efficient and flexible VLSI architecture andimplementation for the voice word-recognizer and CELP voice compression systemare presented. In order to achieve a flexible and efficient VLSI realization,we use a programmable with specific core designstrategy which incorporates the best aspects of both programmable andapplication specific signal processors to achieve high speed, high accuracy,and efficient hardware realization for the speech applications.In this study, we present the CELP processor architectureand VLSI implementation. A programmable application-specific singlechip design for the CELP algorithm drastically reduces the cost andachieves real-time performance.In particular, the issues and accuracy study forthe fixed-point realization are also addressed herein.In order to speed up the codebook search, we also present a new classifieddynamic partial search structure for stochastic codebook of FS1016 CELP coderto replace the fixed partial search for selecting the best excitation vectorof stochastic codebook. In the proposed scheme, the conventional one-stagestochastic codebook search is substituted with a two-stage dynamic methodfor reducing the computational complexity without degrading the voice quality.We also propose the VLSI architecture and chip implementation for theFS1016 CELP decoder with reduced power and memory requirements. A singlechip implementation of the CELP decoder drastically reduces the cost andthe size of many CELP vocoder systems. Finally,a new LPC vocoder model based on forward-backward waveform prediction (FBLPC)for very low bit rate speech coding is presented. The FBLPC model only encodesand tranmits partial representative waveforms based on the mixed excitationLPC model. On the other hand, other partial waveforms which are not to beencoded and transmitted are reconstructed by an interpolating method calledforward-backward waveform prediction in the synthesis. This interpolatingmethod can decrease the bit rates rapidly, and the vocoder also can produceintelligible speech. A 1200bps FBLPC vocoder based on this model has beendesigned and implemented in real time. Jhing-Fa Wang 王 駿 發 1997 學位論文 ; thesis 140 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 博士 === 國立成功大學 === 資訊工程學系 === 85 === In this disserpation, the efficient and flexible VLSI architecture andimplementation for the voice word-recognizer and CELP voice compression systemare presented. In order to achieve a flexible and efficient VLSI realization,we use a programmable with specific core designstrategy which incorporates the best aspects of both programmable andapplication specific signal processors to achieve high speed, high accuracy,and efficient hardware realization for the speech applications.In this study, we present the CELP processor architectureand VLSI implementation. A programmable application-specific singlechip design for the CELP algorithm drastically reduces the cost andachieves real-time performance.In particular, the issues and accuracy study forthe fixed-point realization are also addressed herein.In order to speed up the codebook search, we also present a new classifieddynamic partial search structure for stochastic codebook of FS1016 CELP coderto replace the fixed partial search for selecting the best excitation vectorof stochastic codebook. In the proposed scheme, the conventional one-stagestochastic codebook search is substituted with a two-stage dynamic methodfor reducing the computational complexity without degrading the voice quality.We also propose the VLSI architecture and chip implementation for theFS1016 CELP decoder with reduced power and memory requirements. A singlechip implementation of the CELP decoder drastically reduces the cost andthe size of many CELP vocoder systems. Finally,a new LPC vocoder model based on forward-backward waveform prediction (FBLPC)for very low bit rate speech coding is presented. The FBLPC model only encodesand tranmits partial representative waveforms based on the mixed excitationLPC model. On the other hand, other partial waveforms which are not to beencoded and transmitted are reconstructed by an interpolating method calledforward-backward waveform prediction in the synthesis. This interpolatingmethod can decrease the bit rates rapidly, and the vocoder also can produceintelligible speech. A 1200bps FBLPC vocoder based on this model has beendesigned and implemented in real time.
author2 Jhing-Fa Wang
author_facet Jhing-Fa Wang
Suen, An-Nan
孫安南
author Suen, An-Nan
孫安南
spellingShingle Suen, An-Nan
孫安南
The Architecture and Chip Implementation for Speech Recognition and Voice Compression Systems
author_sort Suen, An-Nan
title The Architecture and Chip Implementation for Speech Recognition and Voice Compression Systems
title_short The Architecture and Chip Implementation for Speech Recognition and Voice Compression Systems
title_full The Architecture and Chip Implementation for Speech Recognition and Voice Compression Systems
title_fullStr The Architecture and Chip Implementation for Speech Recognition and Voice Compression Systems
title_full_unstemmed The Architecture and Chip Implementation for Speech Recognition and Voice Compression Systems
title_sort architecture and chip implementation for speech recognition and voice compression systems
publishDate 1997
url http://ndltd.ncl.edu.tw/handle/94585011867419107779
work_keys_str_mv AT suenannan thearchitectureandchipimplementationforspeechrecognitionandvoicecompressionsystems
AT sūnānnán thearchitectureandchipimplementationforspeechrecognitionandvoicecompressionsystems
AT suenannan yǔyīnbiànshíjíyǔyīnyāsuōxìtǒngzhīchāodàxíngjītǐdiànlùjiàgòuyǔjīngpiànshèjì
AT sūnānnán yǔyīnbiànshíjíyǔyīnyāsuōxìtǒngzhīchāodàxíngjītǐdiànlùjiàgòuyǔjīngpiànshèjì
AT suenannan architectureandchipimplementationforspeechrecognitionandvoicecompressionsystems
AT sūnānnán architectureandchipimplementationforspeechrecognitionandvoicecompressionsystems
_version_ 1716857165163003904