Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method

碩士 === 國立成功大學 === 電機工程學系 === 102 === This study proposed a low-cost and fast-trainable chip design for automatic speaker-speech recognition (ASSR) system. There are four parts of this proposed system, which is including: feature extraction module, speaker model training module, speaker recognition m...

Full description

Bibliographic Details
Main Authors: Guo-JiWu, 吳國吉
Other Authors: Jhing-Fa Wang
Format: Others
Language:en_US
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/fx625w
id ndltd-TW-102NCKU5442179
record_format oai_dc
spelling ndltd-TW-102NCKU54421792019-05-15T21:42:46Z http://ndltd.ncl.edu.tw/handle/fx625w Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method 具二元對分分裂法之低成本語者及語音辨識系統晶片設計 Guo-JiWu 吳國吉 碩士 國立成功大學 電機工程學系 102 This study proposed a low-cost and fast-trainable chip design for automatic speaker-speech recognition (ASSR) system. There are four parts of this proposed system, which is including: feature extraction module, speaker model training module, speaker recognition module, and speech recognition module. LPCC (Linear Predictive Cepstral Coefficients) is adopted into the proposed feature extraction module. The speech recognition uses dynamic time warping (DTW) to classify the target speech. The novel binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 52% less cost, 68% less responding time, an ASSR accuracy of 90%. This ASSR system to efficiently implement the low cost chip design. This study has been taped-out in TSMC’s 90nm process. The chip area is 1.47*1.47 mm2, 84-pin package, gate count is 395K, and the power dissipation is 8.74 mW. The operation frequency is 50 MHz, while the Sampling rate is 16 kHz. Jhing-Fa Wang 王駿發 2014 學位論文 ; thesis 66 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 電機工程學系 === 102 === This study proposed a low-cost and fast-trainable chip design for automatic speaker-speech recognition (ASSR) system. There are four parts of this proposed system, which is including: feature extraction module, speaker model training module, speaker recognition module, and speech recognition module. LPCC (Linear Predictive Cepstral Coefficients) is adopted into the proposed feature extraction module. The speech recognition uses dynamic time warping (DTW) to classify the target speech. The novel binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 52% less cost, 68% less responding time, an ASSR accuracy of 90%. This ASSR system to efficiently implement the low cost chip design. This study has been taped-out in TSMC’s 90nm process. The chip area is 1.47*1.47 mm2, 84-pin package, gate count is 395K, and the power dissipation is 8.74 mW. The operation frequency is 50 MHz, while the Sampling rate is 16 kHz.
author2 Jhing-Fa Wang
author_facet Jhing-Fa Wang
Guo-JiWu
吳國吉
author Guo-JiWu
吳國吉
spellingShingle Guo-JiWu
吳國吉
Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
author_sort Guo-JiWu
title Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_short Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_full Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_fullStr Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_full_unstemmed Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
title_sort low cost chip design for automatic speaker and speech recognition system using binary halved clustering method
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/fx625w
work_keys_str_mv AT guojiwu lowcostchipdesignforautomaticspeakerandspeechrecognitionsystemusingbinaryhalvedclusteringmethod
AT wúguójí lowcostchipdesignforautomaticspeakerandspeechrecognitionsystemusingbinaryhalvedclusteringmethod
AT guojiwu jùèryuánduìfēnfēnlièfǎzhīdīchéngběnyǔzhějíyǔyīnbiànshíxìtǒngjīngpiànshèjì
AT wúguójí jùèryuánduìfēnfēnlièfǎzhīdīchéngběnyǔzhějíyǔyīnbiànshíxìtǒngjīngpiànshèjì
_version_ 1719119182950301696