Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition

碩士 === 國立暨南國際大學 === 電機工程學系 === 99 === This thesis is investigated in two ways to reach enhance the recognition rates respectively, one is to combine different modes which have different advantages, and the other is cepstral statistics normalization techniques to reduce noise effect. The first part...

Full description

Bibliographic Details
Main Authors: Pei-Shiuan Jiang, 江佩璇
Other Authors: Gin-Der Wu
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/37299647894176686493
id ndltd-TW-099NCNU0442116
record_format oai_dc
spelling ndltd-TW-099NCNU04421162015-10-28T04:07:08Z http://ndltd.ncl.edu.tw/handle/37299647894176686493 Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition 三重混合模型和倒頻譜正規化技術之強健語音辨識 Pei-Shiuan Jiang 江佩璇 碩士 國立暨南國際大學 電機工程學系 99 This thesis is investigated in two ways to reach enhance the recognition rates respectively, one is to combine different modes which have different advantages, and the other is cepstral statistics normalization techniques to reduce noise effect. The first part of thesis combines Linear Discriminant Analysis,(LDA), Principal Comonents Analysis,(PCA), and Minimum Classification Error,(MCE) to train model and reach robust the speech feature. The second part of thesis uses cepstral mean subtraction (CMS), cepstral mean and variance normalization (CMVN) as basic. And normalization techniques use the characteristic parameter which is speech-only to construct codebook. Normalization techniques give weight for every codebook. Then codebook and utterance information are combined in cepstral statistics for normalization. And template matching employs Gaussian Mixture Model, (GMM). Finally, this way compares and discusses the results which are tested in several variable background noises form different conditions. Gin-Der Wu 吳俊德 2011 學位論文 ; thesis 44 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立暨南國際大學 === 電機工程學系 === 99 === This thesis is investigated in two ways to reach enhance the recognition rates respectively, one is to combine different modes which have different advantages, and the other is cepstral statistics normalization techniques to reduce noise effect. The first part of thesis combines Linear Discriminant Analysis,(LDA), Principal Comonents Analysis,(PCA), and Minimum Classification Error,(MCE) to train model and reach robust the speech feature. The second part of thesis uses cepstral mean subtraction (CMS), cepstral mean and variance normalization (CMVN) as basic. And normalization techniques use the characteristic parameter which is speech-only to construct codebook. Normalization techniques give weight for every codebook. Then codebook and utterance information are combined in cepstral statistics for normalization. And template matching employs Gaussian Mixture Model, (GMM). Finally, this way compares and discusses the results which are tested in several variable background noises form different conditions.
author2 Gin-Der Wu
author_facet Gin-Der Wu
Pei-Shiuan Jiang
江佩璇
author Pei-Shiuan Jiang
江佩璇
spellingShingle Pei-Shiuan Jiang
江佩璇
Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition
author_sort Pei-Shiuan Jiang
title Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition
title_short Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition
title_full Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition
title_fullStr Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition
title_full_unstemmed Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition
title_sort triple hybrid model and cepstral statistics normalization techniques for robust speech recognition
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/37299647894176686493
work_keys_str_mv AT peishiuanjiang triplehybridmodelandcepstralstatisticsnormalizationtechniquesforrobustspeechrecognition
AT jiāngpèixuán triplehybridmodelandcepstralstatisticsnormalizationtechniquesforrobustspeechrecognition
AT peishiuanjiang sānzhònghùnhémóxínghédàopínpǔzhèngguīhuàjìshùzhīqiángjiànyǔyīnbiànshí
AT jiāngpèixuán sānzhònghùnhémóxínghédàopínpǔzhèngguīhuàjìshùzhīqiángjiànyǔyīnbiànshí
_version_ 1718113499014496256