DCT-based Processing of Dynamic Features for Robust Speech Recognition
碩士 === 國立暨南國際大學 === 電機工程學系 === 98 === In this thesis, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coeffici...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/32454204118543401202 |
id |
ndltd-TW-098NCNU0442021 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098NCNU04420212015-10-13T18:16:15Z http://ndltd.ncl.edu.tw/handle/32454204118543401202 DCT-based Processing of Dynamic Features for Robust Speech Recognition 使用離散餘弦轉換處理動態特徵之強健性語音辨認 Wen-chi Lin 林文琦 碩士 國立暨南國際大學 電機工程學系 98 In this thesis, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coefficients (MFCC), and the used filters are from the discrete cosine transform (DCT) matrix. We modify these DCT-based filters by windowing, removing DC gain, and varying the filter length. The speech recognition task using Aurora-2 digit database show that the proposed methods can enhance the original CTC in improving the recognition accuracy. The resulting relative error reduction is around 20%. Jeih-weih Hung 洪志偉 2010 學位論文 ; thesis 44 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立暨南國際大學 === 電機工程學系 === 98 === In this thesis, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coefficients (MFCC), and the used filters are from the discrete cosine transform (DCT) matrix. We modify these DCT-based filters by windowing, removing DC gain, and varying the filter length. The speech recognition task using Aurora-2 digit database show that the proposed methods can enhance the original CTC in improving the recognition accuracy. The resulting relative error reduction is around 20%.
|
author2 |
Jeih-weih Hung |
author_facet |
Jeih-weih Hung Wen-chi Lin 林文琦 |
author |
Wen-chi Lin 林文琦 |
spellingShingle |
Wen-chi Lin 林文琦 DCT-based Processing of Dynamic Features for Robust Speech Recognition |
author_sort |
Wen-chi Lin |
title |
DCT-based Processing of Dynamic Features for Robust Speech Recognition |
title_short |
DCT-based Processing of Dynamic Features for Robust Speech Recognition |
title_full |
DCT-based Processing of Dynamic Features for Robust Speech Recognition |
title_fullStr |
DCT-based Processing of Dynamic Features for Robust Speech Recognition |
title_full_unstemmed |
DCT-based Processing of Dynamic Features for Robust Speech Recognition |
title_sort |
dct-based processing of dynamic features for robust speech recognition |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/32454204118543401202 |
work_keys_str_mv |
AT wenchilin dctbasedprocessingofdynamicfeaturesforrobustspeechrecognition AT línwénqí dctbasedprocessingofdynamicfeaturesforrobustspeechrecognition AT wenchilin shǐyònglísànyúxiánzhuǎnhuànchùlǐdòngtàitèzhēngzhīqiángjiànxìngyǔyīnbiànrèn AT línwénqí shǐyònglísànyúxiánzhuǎnhuànchùlǐdòngtàitèzhēngzhīqiángjiànxìngyǔyīnbiànrèn |
_version_ |
1718029430210691072 |