Mixed-Lingual Acoustic Modeling of Hidden Conditional Random Field for Robust Speech Recognition

碩士 === 元智大學 === 通訊工程學系 === 99 === This thesis presents the robust training techniques for hidden conditional random fiels (HCRF)-based acoustic modeling of Mandarin/English mixed-lingual speech recognition. Two issues were dealt with: (1) mixed-lingual speech recognition against with noise effects a...

Full description

Bibliographic Details
Main Authors: Shih-Han Huang, 黃詩涵
Other Authors: 洪維廷
Format: Others
Language:zh-TW
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/66507400162186510812
id ndltd-TW-099YZU05650046
record_format oai_dc
spelling ndltd-TW-099YZU056500462016-04-13T04:17:17Z http://ndltd.ncl.edu.tw/handle/66507400162186510812 Mixed-Lingual Acoustic Modeling of Hidden Conditional Random Field for Robust Speech Recognition 基於隱藏式條件隨機域聲學模型之強健式華英混雜語音辨認演算法 Shih-Han Huang 黃詩涵 碩士 元智大學 通訊工程學系 99 This thesis presents the robust training techniques for hidden conditional random fiels (HCRF)-based acoustic modeling of Mandarin/English mixed-lingual speech recognition. Two issues were dealt with: (1) mixed-lingual speech recognition against with noise effects and (2) cross-lingual errors in mixed-lingual speech recognition. We solved first issue with the REST algorithm and reduce the errors in second issue with a discriminative training algorithm combined by the REST algorithm(D-REST). The experimental results indicate that 16.4% averaged error rate reduction by the HCRF-based framework is achieved under ROVER_2 noise environment compared with the result by the traditional HMM approach. In additional, the cross-lingual error is improved significantly with the HCRF-framework in mixed-lingual speech recognition. 洪維廷 2011 學位論文 ; thesis 77 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元智大學 === 通訊工程學系 === 99 === This thesis presents the robust training techniques for hidden conditional random fiels (HCRF)-based acoustic modeling of Mandarin/English mixed-lingual speech recognition. Two issues were dealt with: (1) mixed-lingual speech recognition against with noise effects and (2) cross-lingual errors in mixed-lingual speech recognition. We solved first issue with the REST algorithm and reduce the errors in second issue with a discriminative training algorithm combined by the REST algorithm(D-REST). The experimental results indicate that 16.4% averaged error rate reduction by the HCRF-based framework is achieved under ROVER_2 noise environment compared with the result by the traditional HMM approach. In additional, the cross-lingual error is improved significantly with the HCRF-framework in mixed-lingual speech recognition.
author2 洪維廷
author_facet 洪維廷
Shih-Han Huang
黃詩涵
author Shih-Han Huang
黃詩涵
spellingShingle Shih-Han Huang
黃詩涵
Mixed-Lingual Acoustic Modeling of Hidden Conditional Random Field for Robust Speech Recognition
author_sort Shih-Han Huang
title Mixed-Lingual Acoustic Modeling of Hidden Conditional Random Field for Robust Speech Recognition
title_short Mixed-Lingual Acoustic Modeling of Hidden Conditional Random Field for Robust Speech Recognition
title_full Mixed-Lingual Acoustic Modeling of Hidden Conditional Random Field for Robust Speech Recognition
title_fullStr Mixed-Lingual Acoustic Modeling of Hidden Conditional Random Field for Robust Speech Recognition
title_full_unstemmed Mixed-Lingual Acoustic Modeling of Hidden Conditional Random Field for Robust Speech Recognition
title_sort mixed-lingual acoustic modeling of hidden conditional random field for robust speech recognition
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/66507400162186510812
work_keys_str_mv AT shihhanhuang mixedlingualacousticmodelingofhiddenconditionalrandomfieldforrobustspeechrecognition
AT huángshīhán mixedlingualacousticmodelingofhiddenconditionalrandomfieldforrobustspeechrecognition
AT shihhanhuang jīyúyǐncángshìtiáojiànsuíjīyùshēngxuémóxíngzhīqiángjiànshìhuáyīnghùnzáyǔyīnbiànrènyǎnsuànfǎ
AT huángshīhán jīyúyǐncángshìtiáojiànsuíjīyùshēngxuémóxíngzhīqiángjiànshìhuáyīnghùnzáyǔyīnbiànrènyǎnsuànfǎ
_version_ 1718222776583585792