Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System
碩士 === 國立臺北科技大學 === 自動化科技研究所 === 93 === The main contribution of this thesis is to develop a real-time speaker recognition system with Speaker Model Mel-Frequency Cepstral Coefficients (SMMFCC) derived from Fast Fourier Transform (FFT). Back-Propagation Neural Network is used on ARM-based embedded s...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2005
|
Online Access: | http://ndltd.ncl.edu.tw/handle/6s5qgd |
id |
ndltd-TW-093TIT05146010 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-093TIT051460102019-05-31T03:35:54Z http://ndltd.ncl.edu.tw/handle/6s5qgd Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System 基於語者模型梅爾倒頻譜係數與類神經網路之主從式即時語者辨識系統 Chen-Chih Huang 黃禎智 碩士 國立臺北科技大學 自動化科技研究所 93 The main contribution of this thesis is to develop a real-time speaker recognition system with Speaker Model Mel-Frequency Cepstral Coefficients (SMMFCC) derived from Fast Fourier Transform (FFT). Back-Propagation Neural Network is used on ARM-based embedded system platform to perform the speaker recognition function. Due to the limitations of computing capability and memory of embedded systems, the features extracted from speaker model are reduced. In order to overcome the computation limitation, a client - server architecture is proposed in this thesis. In this architecture, the server deals with the Neural Network training process that requires a great deal of computation, while the client performs the real-time speaker recognition based on the updated weights of neural network which is retrieved from the server. The experimental results show that the average recognition rate of this system is more than 90% and the recognition time is less than 3 seconds. The proposed speaker recognition system can be generally applied to home security, office security, factory security systems, etc. 蔡孟伸 2005 學位論文 ; thesis 75 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺北科技大學 === 自動化科技研究所 === 93 === The main contribution of this thesis is to develop a real-time speaker recognition system with Speaker Model Mel-Frequency Cepstral Coefficients (SMMFCC) derived from Fast Fourier Transform (FFT). Back-Propagation Neural Network is used on ARM-based embedded system platform to perform the speaker recognition function. Due to the limitations of computing capability and memory of embedded systems, the features extracted from speaker model are reduced.
In order to overcome the computation limitation, a client - server architecture is proposed in this thesis. In this architecture, the server deals with the Neural Network training process that requires a great deal of computation, while the client performs the real-time speaker recognition based on the updated weights of neural network which is retrieved from the server.
The experimental results show that the average recognition rate of this system is more than 90% and the recognition time is less than 3 seconds. The proposed speaker recognition system can be generally applied to home security, office security, factory security systems, etc.
|
author2 |
蔡孟伸 |
author_facet |
蔡孟伸 Chen-Chih Huang 黃禎智 |
author |
Chen-Chih Huang 黃禎智 |
spellingShingle |
Chen-Chih Huang 黃禎智 Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System |
author_sort |
Chen-Chih Huang |
title |
Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System |
title_short |
Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System |
title_full |
Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System |
title_fullStr |
Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System |
title_full_unstemmed |
Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System |
title_sort |
development of smmfcc and ann based client-server real-time speaker recognition system |
publishDate |
2005 |
url |
http://ndltd.ncl.edu.tw/handle/6s5qgd |
work_keys_str_mv |
AT chenchihhuang developmentofsmmfccandannbasedclientserverrealtimespeakerrecognitionsystem AT huángzhēnzhì developmentofsmmfccandannbasedclientserverrealtimespeakerrecognitionsystem AT chenchihhuang jīyúyǔzhěmóxíngméiěrdàopínpǔxìshùyǔlèishénjīngwǎnglùzhīzhǔcóngshìjíshíyǔzhěbiànshíxìtǒng AT huángzhēnzhì jīyúyǔzhěmóxíngméiěrdàopínpǔxìshùyǔlèishénjīngwǎnglùzhīzhǔcóngshìjíshíyǔzhěbiànshíxìtǒng |
_version_ |
1719197120082214912 |