Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method
碩士 === 國立成功大學 === 電機工程學系 === 102 === This study proposed a low-cost and fast-trainable chip design for automatic speaker-speech recognition (ASSR) system. There are four parts of this proposed system, which is including: feature extraction module, speaker model training module, speaker recognition m...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2014
|
Online Access: | http://ndltd.ncl.edu.tw/handle/fx625w |
id |
ndltd-TW-102NCKU5442179 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-102NCKU54421792019-05-15T21:42:46Z http://ndltd.ncl.edu.tw/handle/fx625w Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method 具二元對分分裂法之低成本語者及語音辨識系統晶片設計 Guo-JiWu 吳國吉 碩士 國立成功大學 電機工程學系 102 This study proposed a low-cost and fast-trainable chip design for automatic speaker-speech recognition (ASSR) system. There are four parts of this proposed system, which is including: feature extraction module, speaker model training module, speaker recognition module, and speech recognition module. LPCC (Linear Predictive Cepstral Coefficients) is adopted into the proposed feature extraction module. The speech recognition uses dynamic time warping (DTW) to classify the target speech. The novel binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 52% less cost, 68% less responding time, an ASSR accuracy of 90%. This ASSR system to efficiently implement the low cost chip design. This study has been taped-out in TSMC’s 90nm process. The chip area is 1.47*1.47 mm2, 84-pin package, gate count is 395K, and the power dissipation is 8.74 mW. The operation frequency is 50 MHz, while the Sampling rate is 16 kHz. Jhing-Fa Wang 王駿發 2014 學位論文 ; thesis 66 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 電機工程學系 === 102 === This study proposed a low-cost and fast-trainable chip design for automatic speaker-speech recognition (ASSR) system. There are four parts of this proposed system, which is including: feature extraction module, speaker model training module, speaker recognition module, and speech recognition module.
LPCC (Linear Predictive Cepstral Coefficients) is adopted into the proposed feature extraction module. The speech recognition uses dynamic time warping (DTW) to classify the target speech. The novel binary halved clustering (BHC) method uses binary-halved splitting to generate speaker models for low complexity requirement. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 52% less cost, 68% less responding time, an ASSR accuracy of 90%. This ASSR system to efficiently implement the low cost chip design.
This study has been taped-out in TSMC’s 90nm process. The chip area is 1.47*1.47 mm2, 84-pin package, gate count is 395K, and the power dissipation is 8.74 mW. The operation frequency is 50 MHz, while the Sampling rate is 16 kHz.
|
author2 |
Jhing-Fa Wang |
author_facet |
Jhing-Fa Wang Guo-JiWu 吳國吉 |
author |
Guo-JiWu 吳國吉 |
spellingShingle |
Guo-JiWu 吳國吉 Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method |
author_sort |
Guo-JiWu |
title |
Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method |
title_short |
Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method |
title_full |
Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method |
title_fullStr |
Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method |
title_full_unstemmed |
Low Cost Chip Design for Automatic Speaker and Speech Recognition System Using Binary Halved Clustering Method |
title_sort |
low cost chip design for automatic speaker and speech recognition system using binary halved clustering method |
publishDate |
2014 |
url |
http://ndltd.ncl.edu.tw/handle/fx625w |
work_keys_str_mv |
AT guojiwu lowcostchipdesignforautomaticspeakerandspeechrecognitionsystemusingbinaryhalvedclusteringmethod AT wúguójí lowcostchipdesignforautomaticspeakerandspeechrecognitionsystemusingbinaryhalvedclusteringmethod AT guojiwu jùèryuánduìfēnfēnlièfǎzhīdīchéngběnyǔzhějíyǔyīnbiànshíxìtǒngjīngpiànshèjì AT wúguójí jùèryuánduìfēnfēnlièfǎzhīdīchéngběnyǔzhějíyǔyīnbiànshíxìtǒngjīngpiànshèjì |
_version_ |
1719119182950301696 |