DCT-based Processing of Dynamic Features for Robust Speech Recognition

碩士 === 國立暨南國際大學 === 電機工程學系 === 98 === In this thesis, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coeffici...

Full description

Bibliographic Details
Main Authors:	Wen-chi Lin, 林文琦
Other Authors:	Jeih-weih Hung
Format:	Others
Language:	en_US
Published:	2010
Online Access:	http://ndltd.ncl.edu.tw/handle/32454204118543401202

id	ndltd-TW-098NCNU0442021
record_format	oai_dc
spelling	ndltd-TW-098NCNU04420212015-10-13T18:16:15Z http://ndltd.ncl.edu.tw/handle/32454204118543401202 DCT-based Processing of Dynamic Features for Robust Speech Recognition 使用離散餘弦轉換處理動態特徵之強健性語音辨認 Wen-chi Lin 林文琦碩士國立暨南國際大學電機工程學系 98 In this thesis, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coefficients (MFCC), and the used filters are from the discrete cosine transform (DCT) matrix. We modify these DCT-based filters by windowing, removing DC gain, and varying the filter length. The speech recognition task using Aurora-2 digit database show that the proposed methods can enhance the original CTC in improving the recognition accuracy. The resulting relative error reduction is around 20%. Jeih-weih Hung 洪志偉 2010 學位論文 ; thesis 44 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立暨南國際大學 === 電機工程學系 === 98 === In this thesis, we explore the various properties of cepstral time coefficients (CTC) in speech recognition, and then propose several methods to refine the CTC construction process. It is found that CTC are the filtered version of mel-frequency cepstral coefficients (MFCC), and the used filters are from the discrete cosine transform (DCT) matrix. We modify these DCT-based filters by windowing, removing DC gain, and varying the filter length. The speech recognition task using Aurora-2 digit database show that the proposed methods can enhance the original CTC in improving the recognition accuracy. The resulting relative error reduction is around 20%.
author2	Jeih-weih Hung
author_facet	Jeih-weih Hung Wen-chi Lin 林文琦
author	Wen-chi Lin 林文琦
spellingShingle	Wen-chi Lin 林文琦 DCT-based Processing of Dynamic Features for Robust Speech Recognition
author_sort	Wen-chi Lin
title	DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_short	DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_full	DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_fullStr	DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_full_unstemmed	DCT-based Processing of Dynamic Features for Robust Speech Recognition
title_sort	dct-based processing of dynamic features for robust speech recognition
publishDate	2010
url	http://ndltd.ncl.edu.tw/handle/32454204118543401202
work_keys_str_mv	AT wenchilin dctbasedprocessingofdynamicfeaturesforrobustspeechrecognition AT línwénqí dctbasedprocessingofdynamicfeaturesforrobustspeechrecognition AT wenchilin shǐyònglísànyúxiánzhuǎnhuànchùlǐdòngtàitèzhēngzhīqiángjiànxìngyǔyīnbiànrèn AT línwénqí shǐyònglísànyúxiánzhuǎnhuànchùlǐdòngtàitèzhēngzhīqiángjiànxìngyǔyīnbiànrèn
_version_	1718029430210691072

DCT-based Processing of Dynamic Features for Robust Speech Recognition

Similar Items