A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
碩士 === 國立交通大學 === 電信工程研究所 === 100 === This study focuses on how to use a Weighted Finite-State Transducer (WFST) to construct a Mandarin Speech Recognition System (MSRS). It first introduces algorithms for WFST, as well as different levels of speech model to represent the Finite-State Machine (FSM)...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2012
|
Online Access: | http://ndltd.ncl.edu.tw/handle/28065850188969895626 |
id |
ndltd-TW-100NCTU5435115 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100NCTU54351152016-03-28T04:20:39Z http://ndltd.ncl.edu.tw/handle/28065850188969895626 A Mandarin Speech Recognition System Using Weighted Finite-State Transducer 使用有限狀態轉換器之漢語語音辨認系統 Lin, Ang-Hsing 林昂星 碩士 國立交通大學 電信工程研究所 100 This study focuses on how to use a Weighted Finite-State Transducer (WFST) to construct a Mandarin Speech Recognition System (MSRS). It first introduces algorithms for WFST, as well as different levels of speech model to represent the Finite-State Machine (FSM) graph, and integrates into the MSRS. The experimental results indentify the Word Error Rate (WER) at about 55% is related to the appearance of OOV words, and statistics shows that one OOV word results in 2.4 words error averagely in MSRS. According to the statistical results, it shows that the names OOV words accounts for about 30%, and in which three words Chinese names accounts for about 23%, In order to reduce the negative impact of the OOV words results in the MSRS, we introduce a hierarchical language model, training name model to assist lower WER. The test corpus uses for the read-type long sentences TCC300 corpus. The 13.76% WER is obtained by using HTK two-stage recognition, while use of WFST RT=13 can achieve the same WER, the recognition speed is about 15 times faster than the traditional HTK. Besides, we construct OOVs names model in the language model layer and placed in the MSRS, this effectively reduces the WER at about 0.12%. Wang, Yih-Ru 王逸如 2012 學位論文 ; thesis 48 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 電信工程研究所 === 100 === This study focuses on how to use a Weighted Finite-State Transducer (WFST) to construct a Mandarin Speech Recognition System (MSRS). It first introduces algorithms for WFST, as well as different levels of speech model to represent the Finite-State Machine (FSM) graph, and integrates into the MSRS. The experimental results indentify the Word Error Rate (WER) at about 55% is related to the appearance of OOV words, and statistics shows that one OOV word results in 2.4 words error averagely in MSRS. According to the statistical results, it shows that the names OOV words accounts for about 30%, and in which three words Chinese names accounts for about 23%, In order to reduce the negative impact of the OOV words results in the MSRS, we introduce a hierarchical language model, training name model to assist lower WER.
The test corpus uses for the read-type long sentences TCC300 corpus. The 13.76% WER is obtained by using HTK two-stage recognition, while use of WFST RT=13 can achieve the same WER, the recognition speed is about 15 times faster than the traditional HTK. Besides, we construct OOVs names model in the language model layer and placed in the MSRS, this effectively reduces the WER at about 0.12%.
|
author2 |
Wang, Yih-Ru |
author_facet |
Wang, Yih-Ru Lin, Ang-Hsing 林昂星 |
author |
Lin, Ang-Hsing 林昂星 |
spellingShingle |
Lin, Ang-Hsing 林昂星 A Mandarin Speech Recognition System Using Weighted Finite-State Transducer |
author_sort |
Lin, Ang-Hsing |
title |
A Mandarin Speech Recognition System Using Weighted Finite-State Transducer |
title_short |
A Mandarin Speech Recognition System Using Weighted Finite-State Transducer |
title_full |
A Mandarin Speech Recognition System Using Weighted Finite-State Transducer |
title_fullStr |
A Mandarin Speech Recognition System Using Weighted Finite-State Transducer |
title_full_unstemmed |
A Mandarin Speech Recognition System Using Weighted Finite-State Transducer |
title_sort |
mandarin speech recognition system using weighted finite-state transducer |
publishDate |
2012 |
url |
http://ndltd.ncl.edu.tw/handle/28065850188969895626 |
work_keys_str_mv |
AT linanghsing amandarinspeechrecognitionsystemusingweightedfinitestatetransducer AT línángxīng amandarinspeechrecognitionsystemusingweightedfinitestatetransducer AT linanghsing shǐyòngyǒuxiànzhuàngtàizhuǎnhuànqìzhīhànyǔyǔyīnbiànrènxìtǒng AT línángxīng shǐyòngyǒuxiànzhuàngtàizhuǎnhuànqìzhīhànyǔyǔyīnbiànrènxìtǒng AT linanghsing mandarinspeechrecognitionsystemusingweightedfinitestatetransducer AT línángxīng mandarinspeechrecognitionsystemusingweightedfinitestatetransducer |
_version_ |
1718213524326449152 |