Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method

碩士 === 國立成功大學 === 電機工程學系 === 102 === This study proposed a low-cost and fast-trainable chip design for automatic speaker-speech recognition (ASSR) system. There are four parts of this proposed system, which is including: feature extraction module, speaker model training module, speaker recognition m...

Full description

Bibliographic Details
Main Authors:	Guo-JiWu, 吳國吉
Other Authors:	Jhing-Fa Wang
Format:	Others
Language:	en_US
Published:	2014
Online Access:	http://ndltd.ncl.edu.tw/handle/fx625w

id	ndltd-TW-102NCKU5442179
record_format	oai_dc
spelling	ndltd-TW-102NCKU54421792019-05-15T21:42:46Z http://ndltd.ncl.edu.tw/handle/fx625w Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method 具二元對分分裂法之低成本語者及語音辨識系統晶片設計 Guo-JiWu 吳國吉碩士國立成功大學電機工程學系 102 This study proposed a low-cost and fast-trainable chip design for automatic speaker-speech recognition (ASSR) system. There are four parts of this proposed system, which is including: feature extraction module, speaker model training module, speaker recognition module, and speech recognition module. LPCC (Linear Predictive Cepstral Coefficients) is adopted into the proposed feature extraction module. The speech recognition uses dynamic time warping (DTW) to classify the target speech. The novel binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 52% less cost, 68% less responding time, an ASSR accuracy of 90%. This ASSR system to efficiently implement the low cost chip design. This study has been taped-out in TSMC’s 90nm process. The chip area is 1.47*1.47 mm2, 84-pin package, gate count is 395K, and the power dissipation is 8.74 mW. The operation frequency is 50 MHz, while the Sampling rate is 16 kHz. Jhing-Fa Wang 王駿發 2014 學位論文 ; thesis 66 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立成功大學 === 電機工程學系 === 102 === This study proposed a low-cost and fast-trainable chip design for automatic speaker-speech recognition (ASSR) system. There are four parts of this proposed system, which is including: feature extraction module, speaker model training module, speaker recognition module, and speech recognition module. LPCC (Linear Predictive Cepstral Coefficients) is adopted into the proposed feature extraction module. The speech recognition uses dynamic time warping (DTW) to classify the target speech. The novel binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 52% less cost, 68% less responding time, an ASSR accuracy of 90%. This ASSR system to efficiently implement the low cost chip design. This study has been taped-out in TSMC’s 90nm process. The chip area is 1.47*1.47 mm2, 84-pin package, gate count is 395K, and the power dissipation is 8.74 mW. The operation frequency is 50 MHz, while the Sampling rate is 16 kHz.
author2	Jhing-Fa Wang
author_facet	Jhing-Fa Wang Guo-JiWu 吳國吉
author	Guo-JiWu 吳國吉
spellingShingle	Guo-JiWu 吳國吉 Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
author_sort	Guo-JiWu
title	Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_short	Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_full	Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_fullStr	Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_full_unstemmed	Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_sort	low cost chip design for automatic speaker and speech recognition system using binary halved clustering method
publishDate	2014
url	http://ndltd.ncl.edu.tw/handle/fx625w
work_keys_str_mv	AT guojiwu lowcostchipdesignforautomaticspeakerandspeechrecognitionsystemusingbinaryhalvedclusteringmethod AT wúguójí lowcostchipdesignforautomaticspeakerandspeechrecognitionsystemusingbinaryhalvedclusteringmethod AT guojiwu jùèryuánduìfēnfēnlièfǎzhīdīchéngběnyǔzhějíyǔyīnbiànshíxìtǒngjīngpiànshèjì AT wúguójí jùèryuánduìfēnfēnlièfǎzhīdīchéngběnyǔzhějíyǔyīnbiànshíxìtǒngjīngpiànshèjì
_version_	1719119182950301696

Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method

Similar Items