Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System

碩士 === 國立臺北科技大學 === 自動化科技研究所 === 93 === The main contribution of this thesis is to develop a real-time speaker recognition system with Speaker Model Mel-Frequency Cepstral Coefficients (SMMFCC) derived from Fast Fourier Transform (FFT). Back-Propagation Neural Network is used on ARM-based embedded s...

Full description

Bibliographic Details
Main Authors: Chen-Chih Huang, 黃禎智
Other Authors: 蔡孟伸
Format: Others
Language:zh-TW
Published: 2005
Online Access:http://ndltd.ncl.edu.tw/handle/6s5qgd
id ndltd-TW-093TIT05146010
record_format oai_dc
spelling ndltd-TW-093TIT051460102019-05-31T03:35:54Z http://ndltd.ncl.edu.tw/handle/6s5qgd Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System 基於語者模型梅爾倒頻譜係數與類神經網路之主從式即時語者辨識系統 Chen-Chih Huang 黃禎智 碩士 國立臺北科技大學 自動化科技研究所 93 The main contribution of this thesis is to develop a real-time speaker recognition system with Speaker Model Mel-Frequency Cepstral Coefficients (SMMFCC) derived from Fast Fourier Transform (FFT). Back-Propagation Neural Network is used on ARM-based embedded system platform to perform the speaker recognition function. Due to the limitations of computing capability and memory of embedded systems, the features extracted from speaker model are reduced. In order to overcome the computation limitation, a client - server architecture is proposed in this thesis. In this architecture, the server deals with the Neural Network training process that requires a great deal of computation, while the client performs the real-time speaker recognition based on the updated weights of neural network which is retrieved from the server. The experimental results show that the average recognition rate of this system is more than 90% and the recognition time is less than 3 seconds. The proposed speaker recognition system can be generally applied to home security, office security, factory security systems, etc. 蔡孟伸 2005 學位論文 ; thesis 75 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北科技大學 === 自動化科技研究所 === 93 === The main contribution of this thesis is to develop a real-time speaker recognition system with Speaker Model Mel-Frequency Cepstral Coefficients (SMMFCC) derived from Fast Fourier Transform (FFT). Back-Propagation Neural Network is used on ARM-based embedded system platform to perform the speaker recognition function. Due to the limitations of computing capability and memory of embedded systems, the features extracted from speaker model are reduced. In order to overcome the computation limitation, a client - server architecture is proposed in this thesis. In this architecture, the server deals with the Neural Network training process that requires a great deal of computation, while the client performs the real-time speaker recognition based on the updated weights of neural network which is retrieved from the server. The experimental results show that the average recognition rate of this system is more than 90% and the recognition time is less than 3 seconds. The proposed speaker recognition system can be generally applied to home security, office security, factory security systems, etc.
author2 蔡孟伸
author_facet 蔡孟伸
Chen-Chih Huang
黃禎智
author Chen-Chih Huang
黃禎智
spellingShingle Chen-Chih Huang
黃禎智
Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System
author_sort Chen-Chih Huang
title Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System
title_short Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System
title_full Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System
title_fullStr Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System
title_full_unstemmed Development of SMMFCC and ANN based Client-Server Real-Time Speaker Recognition System
title_sort development of smmfcc and ann based client-server real-time speaker recognition system
publishDate 2005
url http://ndltd.ncl.edu.tw/handle/6s5qgd
work_keys_str_mv AT chenchihhuang developmentofsmmfccandannbasedclientserverrealtimespeakerrecognitionsystem
AT huángzhēnzhì developmentofsmmfccandannbasedclientserverrealtimespeakerrecognitionsystem
AT chenchihhuang jīyúyǔzhěmóxíngméiěrdàopínpǔxìshùyǔlèishénjīngwǎnglùzhīzhǔcóngshìjíshíyǔzhěbiànshíxìtǒng
AT huángzhēnzhì jīyúyǔzhěmóxíngméiěrdàopínpǔxìshùyǔlèishénjīngwǎnglùzhīzhǔcóngshìjíshíyǔzhěbiànshíxìtǒng
_version_ 1719197120082214912