DCT-based Processing of Dynamic Features for Robust Speech Recognition

碩士 === 國立暨南國際大學 === 電機工程學系 === 98 === In this thesis, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coeffici...

Full description

Bibliographic Details
Main Authors: Wen-chi Lin, 林文琦
Other Authors: Jeih-weih Hung
Format: Others
Language:en_US
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/32454204118543401202
id ndltd-TW-098NCNU0442021
record_format oai_dc
spelling ndltd-TW-098NCNU04420212015-10-13T18:16:15Z http://ndltd.ncl.edu.tw/handle/32454204118543401202 DCT-based Processing of Dynamic Features for Robust Speech Recognition 使用離散餘弦轉換處理動態特徵之強健性語音辨認 Wen-chi Lin 林文琦 碩士 國立暨南國際大學 電機工程學系 98 In this thesis, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coefficients (MFCC), and the used filters are from the discrete cosine transform (DCT) matrix. We modify these DCT-based filters by windowing, removing DC gain, and varying the filter length. The speech recognition task using Aurora-2 digit database show that the proposed methods can enhance the original CTC in improving the recognition accuracy. The resulting relative error reduction is around 20%. Jeih-weih Hung 洪志偉 2010 學位論文 ; thesis 44 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立暨南國際大學 === 電機工程學系 === 98 === In this thesis, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coefficients (MFCC), and the used filters are from the discrete cosine transform (DCT) matrix. We modify these DCT-based filters by windowing, removing DC gain, and varying the filter length. The speech recognition task using Aurora-2 digit database show that the proposed methods can enhance the original CTC in improving the recognition accuracy. The resulting relative error reduction is around 20%.
author2 Jeih-weih Hung
author_facet Jeih-weih Hung
Wen-chi Lin
林文琦
author Wen-chi Lin
林文琦
spellingShingle Wen-chi Lin
林文琦
DCT-based Processing of Dynamic Features for Robust Speech Recognition
author_sort Wen-chi Lin
title DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_short DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_full DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_fullStr DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_full_unstemmed DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_sort dct-based processing of dynamic features for robust speech recognition
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/32454204118543401202
work_keys_str_mv AT wenchilin dctbasedprocessingofdynamicfeaturesforrobustspeechrecognition
AT línwénqí dctbasedprocessingofdynamicfeaturesforrobustspeechrecognition
AT wenchilin shǐyònglísànyúxiánzhuǎnhuànchùlǐdòngtàitèzhēngzhīqiángjiànxìngyǔyīnbiànrèn
AT línwénqí shǐyònglísànyúxiánzhuǎnhuànchùlǐdòngtàitèzhēngzhīqiángjiànxìngyǔyīnbiànrèn
_version_ 1718029430210691072