Speech recognition on DSP: algorithm optimization and performance analysis.
Yuan Meng. === Thesis (M.Phil.)--Chinese University of Hong Kong, 2004. === Includes bibliographical references (leaves 85-91). === Abstracts in English and Chinese. === Chapter 1 --- Introduction --- p.1 === Chapter 1.1 --- History of ASR development --- p.2 === Chapter 1.2 --- Fundamentals of au...
Other Authors: | |
---|---|
Format: | Others |
Language: | English Chinese |
Published: |
2004
|
Subjects: | |
Online Access: | http://library.cuhk.edu.hk/record=b5892165 http://repository.lib.cuhk.edu.hk/en/item/cuhk-324923 |
id |
ndltd-cuhk.edu.hk-oai-cuhk-dr-cuhk_324923 |
---|---|
record_format |
oai_dc |
collection |
NDLTD |
language |
English Chinese |
format |
Others
|
sources |
NDLTD |
topic |
Automatic speech recognition Signal processing--Digital techniques |
spellingShingle |
Automatic speech recognition Signal processing--Digital techniques Speech recognition on DSP: algorithm optimization and performance analysis. |
description |
Yuan Meng. === Thesis (M.Phil.)--Chinese University of Hong Kong, 2004. === Includes bibliographical references (leaves 85-91). === Abstracts in English and Chinese. === Chapter 1 --- Introduction --- p.1 === Chapter 1.1 --- History of ASR development --- p.2 === Chapter 1.2 --- Fundamentals of automatic speech recognition --- p.3 === Chapter 1.2.1 --- Classification of ASR systems --- p.3 === Chapter 1.2.2 --- Automatic speech recognition process --- p.4 === Chapter 1.3 --- Performance measurements of ASR --- p.7 === Chapter 1.3.1 --- Recognition accuracy --- p.7 === Chapter 1.3.2 --- Complexity --- p.7 === Chapter 1.3.3 --- Robustness --- p.8 === Chapter 1.4 --- Motivation and goal of this work --- p.8 === Chapter 1.5 --- Thesis outline --- p.10 === Chapter 2 --- Signal processing techniques for front-end --- p.12 === Chapter 2.1 --- Basic feature extraction principles --- p.13 === Chapter 2.1.1 --- Pre-emphasis --- p.13 === Chapter 2.1.2 --- Frame blocking and windowing --- p.13 === Chapter 2.1.3 --- Discrete Fourier Transform (DFT) computation --- p.15 === Chapter 2.1.4 --- Spectral magnitudes --- p.15 === Chapter 2.1.5 --- Mel-frequency filterbank --- p.16 === Chapter 2.1.6 --- Logarithm of filter energies --- p.18 === Chapter 2.1.7 --- Discrete Cosine Transformation (DCT) --- p.18 === Chapter 2.1.8 --- Cepstral Weighting --- p.19 === Chapter 2.1.9 --- Dynamic featuring --- p.19 === Chapter 2.2 --- Practical issues --- p.20 === Chapter 2.2.1 --- Review of practical problems and solutions in ASR appli- cations --- p.20 === Chapter 2.2.2 --- Model of environment --- p.23 === Chapter 2.2.3 --- End-point detection (EPD) --- p.23 === Chapter 2.2.4 --- Spectral subtraction (SS) --- p.25 === Chapter 3 --- HMM-based Acoustic Modeling --- p.26 === Chapter 3.1 --- HMMs for ASR --- p.26 === Chapter 3.2 --- Output probabilities --- p.27 === Chapter 3.3 --- Viterbi search engine --- p.29 === Chapter 3.4 --- Isolated word recognition (IWR) & Connected word recognition (CWR) --- p.30 === Chapter 3.4.1 --- Isolated word recognition --- p.30 === Chapter 3.4.2 --- Connected word recognition (CWR) --- p.31 === Chapter 4 --- DSP for embedded applications --- p.32 === Chapter 4.1 --- "Classification of embedded systems (DSP, ASIC, FPGA, etc.)" --- p.32 === Chapter 4.2 --- Description of hardware platform --- p.34 === Chapter 4.3 --- I/O operation for real-time processing --- p.36 === Chapter 4.4 --- Fixed point algorithm on DSP --- p.40 === Chapter 5 --- ASR algorithm optimization --- p.42 === Chapter 5.1 --- Methodology --- p.42 === Chapter 5.2 --- Floating-point to fixed-point conversion --- p.43 === Chapter 5.3 --- Computational complexity consideration --- p.45 === Chapter 5.3.1 --- Feature extraction techniques --- p.45 === Chapter 5.3.2 --- Viterbi search module --- p.50 === Chapter 5.4 --- Memory requirements consideration --- p.51 === Chapter 6 --- Experimental results and performance analysis --- p.53 === Chapter 6.1 --- Cantonese isolated word recognition (IWR) --- p.54 === Chapter 6.1.1 --- Execution time --- p.54 === Chapter 6.1.2 --- Memory requirements --- p.57 === Chapter 6.1.3 --- Recognition performance --- p.57 === Chapter 6.2 --- Connected word recognition (CWR) --- p.61 === Chapter 6.2.1 --- Execution time consideration --- p.62 === Chapter 6.2.2 --- Recognition performance --- p.62 === Chapter 6.3 --- Summary & discussion --- p.66 === Chapter 7 --- Implementation of practical techniques --- p.67 === Chapter 7.1 --- End-point detection (EPD) --- p.67 === Chapter 7.2 --- Spectral subtraction (SS) --- p.71 === Chapter 7.3 --- Experimental results --- p.72 === Chapter 7.3.1 --- Isolated word recognition (IWR) --- p.72 === Chapter 7.3.2 --- Connected word recognition (CWR) --- p.75 === Chapter 7.4 --- Results --- p.77 === Chapter 8 --- Conclusions and future work --- p.78 === Chapter 8.1 --- Summary and Conclusions --- p.78 === Chapter 8.2 --- Suggestions for future research --- p.80 === Appendices --- p.82 === Chapter A --- "Interpolation of data entries without floating point, divides or conditional branches" --- p.82 === Chapter B --- Vocabulary for Cantonese isolated word recognition task --- p.84 === Bibliography --- p.85 |
author2 |
Yuan, Meng. |
author_facet |
Yuan, Meng. |
title |
Speech recognition on DSP: algorithm optimization and performance analysis. |
title_short |
Speech recognition on DSP: algorithm optimization and performance analysis. |
title_full |
Speech recognition on DSP: algorithm optimization and performance analysis. |
title_fullStr |
Speech recognition on DSP: algorithm optimization and performance analysis. |
title_full_unstemmed |
Speech recognition on DSP: algorithm optimization and performance analysis. |
title_sort |
speech recognition on dsp: algorithm optimization and performance analysis. |
publishDate |
2004 |
url |
http://library.cuhk.edu.hk/record=b5892165 http://repository.lib.cuhk.edu.hk/en/item/cuhk-324923 |
_version_ |
1718990120156135424 |
spelling |
ndltd-cuhk.edu.hk-oai-cuhk-dr-cuhk_3249232019-03-05T03:32:48Z Speech recognition on DSP: algorithm optimization and performance analysis. Automatic speech recognition Signal processing--Digital techniques Yuan Meng. Thesis (M.Phil.)--Chinese University of Hong Kong, 2004. Includes bibliographical references (leaves 85-91). Abstracts in English and Chinese. Chapter 1 --- Introduction --- p.1 Chapter 1.1 --- History of ASR development --- p.2 Chapter 1.2 --- Fundamentals of automatic speech recognition --- p.3 Chapter 1.2.1 --- Classification of ASR systems --- p.3 Chapter 1.2.2 --- Automatic speech recognition process --- p.4 Chapter 1.3 --- Performance measurements of ASR --- p.7 Chapter 1.3.1 --- Recognition accuracy --- p.7 Chapter 1.3.2 --- Complexity --- p.7 Chapter 1.3.3 --- Robustness --- p.8 Chapter 1.4 --- Motivation and goal of this work --- p.8 Chapter 1.5 --- Thesis outline --- p.10 Chapter 2 --- Signal processing techniques for front-end --- p.12 Chapter 2.1 --- Basic feature extraction principles --- p.13 Chapter 2.1.1 --- Pre-emphasis --- p.13 Chapter 2.1.2 --- Frame blocking and windowing --- p.13 Chapter 2.1.3 --- Discrete Fourier Transform (DFT) computation --- p.15 Chapter 2.1.4 --- Spectral magnitudes --- p.15 Chapter 2.1.5 --- Mel-frequency filterbank --- p.16 Chapter 2.1.6 --- Logarithm of filter energies --- p.18 Chapter 2.1.7 --- Discrete Cosine Transformation (DCT) --- p.18 Chapter 2.1.8 --- Cepstral Weighting --- p.19 Chapter 2.1.9 --- Dynamic featuring --- p.19 Chapter 2.2 --- Practical issues --- p.20 Chapter 2.2.1 --- Review of practical problems and solutions in ASR appli- cations --- p.20 Chapter 2.2.2 --- Model of environment --- p.23 Chapter 2.2.3 --- End-point detection (EPD) --- p.23 Chapter 2.2.4 --- Spectral subtraction (SS) --- p.25 Chapter 3 --- HMM-based Acoustic Modeling --- p.26 Chapter 3.1 --- HMMs for ASR --- p.26 Chapter 3.2 --- Output probabilities --- p.27 Chapter 3.3 --- Viterbi search engine --- p.29 Chapter 3.4 --- Isolated word recognition (IWR) & Connected word recognition (CWR) --- p.30 Chapter 3.4.1 --- Isolated word recognition --- p.30 Chapter 3.4.2 --- Connected word recognition (CWR) --- p.31 Chapter 4 --- DSP for embedded applications --- p.32 Chapter 4.1 --- "Classification of embedded systems (DSP, ASIC, FPGA, etc.)" --- p.32 Chapter 4.2 --- Description of hardware platform --- p.34 Chapter 4.3 --- I/O operation for real-time processing --- p.36 Chapter 4.4 --- Fixed point algorithm on DSP --- p.40 Chapter 5 --- ASR algorithm optimization --- p.42 Chapter 5.1 --- Methodology --- p.42 Chapter 5.2 --- Floating-point to fixed-point conversion --- p.43 Chapter 5.3 --- Computational complexity consideration --- p.45 Chapter 5.3.1 --- Feature extraction techniques --- p.45 Chapter 5.3.2 --- Viterbi search module --- p.50 Chapter 5.4 --- Memory requirements consideration --- p.51 Chapter 6 --- Experimental results and performance analysis --- p.53 Chapter 6.1 --- Cantonese isolated word recognition (IWR) --- p.54 Chapter 6.1.1 --- Execution time --- p.54 Chapter 6.1.2 --- Memory requirements --- p.57 Chapter 6.1.3 --- Recognition performance --- p.57 Chapter 6.2 --- Connected word recognition (CWR) --- p.61 Chapter 6.2.1 --- Execution time consideration --- p.62 Chapter 6.2.2 --- Recognition performance --- p.62 Chapter 6.3 --- Summary & discussion --- p.66 Chapter 7 --- Implementation of practical techniques --- p.67 Chapter 7.1 --- End-point detection (EPD) --- p.67 Chapter 7.2 --- Spectral subtraction (SS) --- p.71 Chapter 7.3 --- Experimental results --- p.72 Chapter 7.3.1 --- Isolated word recognition (IWR) --- p.72 Chapter 7.3.2 --- Connected word recognition (CWR) --- p.75 Chapter 7.4 --- Results --- p.77 Chapter 8 --- Conclusions and future work --- p.78 Chapter 8.1 --- Summary and Conclusions --- p.78 Chapter 8.2 --- Suggestions for future research --- p.80 Appendices --- p.82 Chapter A --- "Interpolation of data entries without floating point, divides or conditional branches" --- p.82 Chapter B --- Vocabulary for Cantonese isolated word recognition task --- p.84 Bibliography --- p.85 Yuan, Meng. Chinese University of Hong Kong Graduate School. Division of Electronic Engineering. 2004 Text bibliography print xi, 91 leaves : ill. ; 30 cm. cuhk:324923 http://library.cuhk.edu.hk/record=b5892165 eng chi Use of this resource is governed by the terms and conditions of the Creative Commons “Attribution-NonCommercial-NoDerivatives 4.0 International” License (http://creativecommons.org/licenses/by-nc-nd/4.0/) http://repository.lib.cuhk.edu.hk/en/islandora/object/cuhk%3A324923/datastream/TN/view/Speech%20recognition%20on%20DSP%20%3A%20algorithm%20optimization%20and%20performance%20analysis.jpghttp://repository.lib.cuhk.edu.hk/en/item/cuhk-324923 |