Implementation of ASSR System Based on HMM and Syllable Models on FPGA

碩士 === 國立成功大學 === 電機工程學系 === 104 === Hidden Markov Models (HMMs) is one of the most popular methods for modern speech recognition. In this thesis, we propose an Automatic Speech-Speaker Recognition (ASSR) system on a FPGA platform. The ASSR system includes four parts: 1) pre-processing, 2) feature e...

Full description

Bibliographic Details
Main Authors:	Wei-XiangLiao, 廖韋翔
Other Authors:	Jhing-Fa Wang
Format:	Others
Language:	en_US
Published:	2016
Online Access:	http://ndltd.ncl.edu.tw/handle/7ppebg

id	ndltd-TW-104NCKU5442137
record_format	oai_dc
spelling	ndltd-TW-104NCKU54421372019-05-15T22:54:11Z http://ndltd.ncl.edu.tw/handle/7ppebg Implementation of ASSR System Based on HMM and Syllable Models on FPGA 以FPGA實現基於HMM之音節模型組成之語音辨識系統 Wei-XiangLiao 廖韋翔碩士國立成功大學電機工程學系 104 Hidden Markov Models (HMMs) is one of the most popular methods for modern speech recognition. In this thesis, we propose an Automatic Speech-Speaker Recognition (ASSR) system on a FPGA platform. The ASSR system includes four parts: 1) pre-processing, 2) feature extraction, 3) speech and speaker recognition and 4) Out-of-Vocabulary (OOV) and Out-of-Speaker (OOS) detection. This study adopts the Mel-frequency cepstral coefficients (MFCCs) as the features for feature extraction module. We use Hidden Markov Model (HMM) to build the acoustic model for each phoneme, and evaluate our approaches on two databases: the THCHS-30 (Tsinghua Chinese 30 hour database) and the CMU ARCTIC Databases. The binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. The last part of ASSR uses the grammar to detect OOV, and the OOS detection algorithm to detect OOS. The experiments are conducted on two types of platforms including PC and Xilinx Spartan-6 FPGA. The experimental results indicate that the proposed work can achieve 90.8% of Mandarin speech recognition and 86.6% of English speech recognition rate, respectively. The work can achieve 88.7% of OOV detection rate of Mandarin and 84.9% of OOV detection rate of English as well. The speaker recognition rate also reaches to 81.3% and OOS detection rate reaches to 80.8%, respectively. Jhing-Fa Wang 王駿發 2016 學位論文 ; thesis 64 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立成功大學 === 電機工程學系 === 104 === Hidden Markov Models (HMMs) is one of the most popular methods for modern speech recognition. In this thesis, we propose an Automatic Speech-Speaker Recognition (ASSR) system on a FPGA platform. The ASSR system includes four parts: 1) pre-processing, 2) feature extraction, 3) speech and speaker recognition and 4) Out-of-Vocabulary (OOV) and Out-of-Speaker (OOS) detection. This study adopts the Mel-frequency cepstral coefficients (MFCCs) as the features for feature extraction module. We use Hidden Markov Model (HMM) to build the acoustic model for each phoneme, and evaluate our approaches on two databases: the THCHS-30 (Tsinghua Chinese 30 hour database) and the CMU ARCTIC Databases. The binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. The last part of ASSR uses the grammar to detect OOV, and the OOS detection algorithm to detect OOS. The experiments are conducted on two types of platforms including PC and Xilinx Spartan-6 FPGA. The experimental results indicate that the proposed work can achieve 90.8% of Mandarin speech recognition and 86.6% of English speech recognition rate, respectively. The work can achieve 88.7% of OOV detection rate of Mandarin and 84.9% of OOV detection rate of English as well. The speaker recognition rate also reaches to 81.3% and OOS detection rate reaches to 80.8%, respectively.
author2	Jhing-Fa Wang
author_facet	Jhing-Fa Wang Wei-XiangLiao 廖韋翔
author	Wei-XiangLiao 廖韋翔
spellingShingle	Wei-XiangLiao 廖韋翔 Implementation of ASSR System Based on HMM and Syllable Models on FPGA
author_sort	Wei-XiangLiao
title	Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_short	Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_full	Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_fullStr	Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_full_unstemmed	Implementation of ASSR System Based on HMM and Syllable Models on FPGA
title_sort	implementation of assr system based on hmm and syllable models on fpga
publishDate	2016
url	http://ndltd.ncl.edu.tw/handle/7ppebg
work_keys_str_mv	AT weixiangliao implementationofassrsystembasedonhmmandsyllablemodelsonfpga AT liàowéixiáng implementationofassrsystembasedonhmmandsyllablemodelsonfpga AT weixiangliao yǐfpgashíxiànjīyúhmmzhīyīnjiémóxíngzǔchéngzhīyǔyīnbiànshíxìtǒng AT liàowéixiáng yǐfpgashíxiànjīyúhmmzhīyīnjiémóxíngzǔchéngzhīyǔyīnbiànshíxìtǒng
_version_	1719137463583113216

Implementation of ASSR System Based on HMM and Syllable Models on FPGA

Similar Items