A Mandarin Speech Recognition System Using Weighted Finite-State Transducer

碩士 === 國立交通大學 === 電信工程研究所 === 100 === This study focuses on how to use a Weighted Finite-State Transducer (WFST) to construct a Mandarin Speech Recognition System (MSRS). It first introduces algorithms for WFST, as well as different levels of speech model to represent the Finite-State Machine (FSM)...

Full description

Bibliographic Details
Main Authors: Lin, Ang-Hsing, 林昂星
Other Authors: Wang, Yih-Ru
Format: Others
Language:zh-TW
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/28065850188969895626
id ndltd-TW-100NCTU5435115
record_format oai_dc
spelling ndltd-TW-100NCTU54351152016-03-28T04:20:39Z http://ndltd.ncl.edu.tw/handle/28065850188969895626 A Mandarin Speech Recognition System Using Weighted Finite-State Transducer 使用有限狀態轉換器之漢語語音辨認系統 Lin, Ang-Hsing 林昂星 碩士 國立交通大學 電信工程研究所 100 This study focuses on how to use a Weighted Finite-State Transducer (WFST) to construct a Mandarin Speech Recognition System (MSRS). It first introduces algorithms for WFST, as well as different levels of speech model to represent the Finite-State Machine (FSM) graph, and integrates into the MSRS. The experimental results indentify the Word Error Rate (WER) at about 55% is related to the appearance of OOV words, and statistics shows that one OOV word results in 2.4 words error averagely in MSRS. According to the statistical results, it shows that the names OOV words accounts for about 30%, and in which three words Chinese names accounts for about 23%, In order to reduce the negative impact of the OOV words results in the MSRS, we introduce a hierarchical language model, training name model to assist lower WER. The test corpus uses for the read-type long sentences TCC300 corpus. The 13.76% WER is obtained by using HTK two-stage recognition, while use of WFST RT=13 can achieve the same WER, the recognition speed is about 15 times faster than the traditional HTK. Besides, we construct OOVs names model in the language model layer and placed in the MSRS, this effectively reduces the WER at about 0.12%. Wang, Yih-Ru 王逸如 2012 學位論文 ; thesis 48 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電信工程研究所 === 100 === This study focuses on how to use a Weighted Finite-State Transducer (WFST) to construct a Mandarin Speech Recognition System (MSRS). It first introduces algorithms for WFST, as well as different levels of speech model to represent the Finite-State Machine (FSM) graph, and integrates into the MSRS. The experimental results indentify the Word Error Rate (WER) at about 55% is related to the appearance of OOV words, and statistics shows that one OOV word results in 2.4 words error averagely in MSRS. According to the statistical results, it shows that the names OOV words accounts for about 30%, and in which three words Chinese names accounts for about 23%, In order to reduce the negative impact of the OOV words results in the MSRS, we introduce a hierarchical language model, training name model to assist lower WER. The test corpus uses for the read-type long sentences TCC300 corpus. The 13.76% WER is obtained by using HTK two-stage recognition, while use of WFST RT=13 can achieve the same WER, the recognition speed is about 15 times faster than the traditional HTK. Besides, we construct OOVs names model in the language model layer and placed in the MSRS, this effectively reduces the WER at about 0.12%.
author2 Wang, Yih-Ru
author_facet Wang, Yih-Ru
Lin, Ang-Hsing
林昂星
author Lin, Ang-Hsing
林昂星
spellingShingle Lin, Ang-Hsing
林昂星
A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
author_sort Lin, Ang-Hsing
title A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_short A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_full A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_fullStr A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_full_unstemmed A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_sort mandarin speech recognition system using weighted finite-state transducer
publishDate 2012
url http://ndltd.ncl.edu.tw/handle/28065850188969895626
work_keys_str_mv AT linanghsing amandarinspeechrecognitionsystemusingweightedfinitestatetransducer
AT línángxīng amandarinspeechrecognitionsystemusingweightedfinitestatetransducer
AT linanghsing shǐyòngyǒuxiànzhuàngtàizhuǎnhuànqìzhīhànyǔyǔyīnbiànrènxìtǒng
AT línángxīng shǐyòngyǒuxiànzhuàngtàizhuǎnhuànqìzhīhànyǔyǔyīnbiànrènxìtǒng
AT linanghsing mandarinspeechrecognitionsystemusingweightedfinitestatetransducer
AT línángxīng mandarinspeechrecognitionsystemusingweightedfinitestatetransducer
_version_ 1718213524326449152