Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition
碩士 === 國立暨南國際大學 === 電機工程學系 === 99 === This thesis is investigated in two ways to reach enhance the recognition rates respectively, one is to combine different modes which have different advantages, and the other is cepstral statistics normalization techniques to reduce noise effect. The first part...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/37299647894176686493 |
id |
ndltd-TW-099NCNU0442116 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099NCNU04421162015-10-28T04:07:08Z http://ndltd.ncl.edu.tw/handle/37299647894176686493 Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition 三重混合模型和倒頻譜正規化技術之強健語音辨識 Pei-Shiuan Jiang 江佩璇 碩士 國立暨南國際大學 電機工程學系 99 This thesis is investigated in two ways to reach enhance the recognition rates respectively, one is to combine different modes which have different advantages, and the other is cepstral statistics normalization techniques to reduce noise effect. The first part of thesis combines Linear Discriminant Analysis,(LDA), Principal Comonents Analysis,(PCA), and Minimum Classification Error,(MCE) to train model and reach robust the speech feature. The second part of thesis uses cepstral mean subtraction (CMS), cepstral mean and variance normalization (CMVN) as basic. And normalization techniques use the characteristic parameter which is speech-only to construct codebook. Normalization techniques give weight for every codebook. Then codebook and utterance information are combined in cepstral statistics for normalization. And template matching employs Gaussian Mixture Model, (GMM). Finally, this way compares and discusses the results which are tested in several variable background noises form different conditions. Gin-Der Wu 吳俊德 2011 學位論文 ; thesis 44 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立暨南國際大學 === 電機工程學系 === 99 === This thesis is investigated in two ways to reach enhance the recognition rates respectively, one is to combine different modes which have different advantages, and the other is cepstral statistics normalization techniques to reduce noise effect.
The first part of thesis combines Linear Discriminant Analysis,(LDA), Principal Comonents Analysis,(PCA), and Minimum Classification Error,(MCE) to train model and reach robust the speech feature.
The second part of thesis uses cepstral mean subtraction (CMS), cepstral mean and variance normalization (CMVN) as basic. And normalization techniques use the characteristic parameter which is speech-only to construct codebook. Normalization techniques give weight for every codebook. Then codebook and utterance information are combined in cepstral statistics for normalization. And template matching employs Gaussian Mixture Model, (GMM). Finally, this way compares and discusses the results which are tested in several variable background noises form different conditions.
|
author2 |
Gin-Der Wu |
author_facet |
Gin-Der Wu Pei-Shiuan Jiang 江佩璇 |
author |
Pei-Shiuan Jiang 江佩璇 |
spellingShingle |
Pei-Shiuan Jiang 江佩璇 Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition |
author_sort |
Pei-Shiuan Jiang |
title |
Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition |
title_short |
Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition |
title_full |
Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition |
title_fullStr |
Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition |
title_full_unstemmed |
Triple Hybrid Model And Cepstral Statistics Normalization Techniques For Robust Speech Recognition |
title_sort |
triple hybrid model and cepstral statistics normalization techniques for robust speech recognition |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/37299647894176686493 |
work_keys_str_mv |
AT peishiuanjiang triplehybridmodelandcepstralstatisticsnormalizationtechniquesforrobustspeechrecognition AT jiāngpèixuán triplehybridmodelandcepstralstatisticsnormalizationtechniquesforrobustspeechrecognition AT peishiuanjiang sānzhònghùnhémóxínghédàopínpǔzhèngguīhuàjìshùzhīqiángjiànyǔyīnbiànshí AT jiāngpèixuán sānzhònghùnhémóxínghédàopínpǔzhèngguīhuàjìshùzhīqiángjiànyǔyīnbiànshí |
_version_ |
1718113499014496256 |