VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features

碩士 === 國立成功大學 === 電機工程學系碩博士班 === 91 === In this thesis, an environmental sound recognition system based on MPEG-7 features (centroid, spread, and flatness [1]) and its corresponding VLSI architectures are proposed. Traditional sound recognizer utilizes decision-tree based method and causes a problem...

Full description

Bibliographic Details
Main Authors:	Tze-Hsuan Huang, 黃子軒
Other Authors:	Jhing-Fa Wang
Format:	Others
Language:	en_US
Published:	2003
Online Access:	http://ndltd.ncl.edu.tw/handle/50906995482258528290

id	ndltd-TW-091NCKU5442211
record_format	oai_dc
spelling	ndltd-TW-091NCKU54422112016-06-22T04:14:02Z http://ndltd.ncl.edu.tw/handle/50906995482258528290 VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features 以MPEG-7特徵為基礎的居家環境聲音辨識器之超大型積體電路架構設計 Tze-Hsuan Huang 黃子軒碩士國立成功大學電機工程學系碩博士班 91 In this thesis, an environmental sound recognition system based on MPEG-7 features (centroid, spread, and flatness [1]) and its corresponding VLSI architectures are proposed. Traditional sound recognizer utilizes decision-tree based method and causes a problem where the parameter is not generalized [2~5]. The HMM based sound recognizer has been introduced by [8] to resolve this drawback. However, it adopts spectrum parameter and will result in high dimensional feature vectors. This thesis successfully solves the shortcoming by taking the basis extraction. The recognition rate is about 82% while only spectrogram is adopted as the parameter. The improved recognition rate is about 95% while above three mentioned MPEG-7 audio features are regarded as the parameters in our environmental sound recognizer. Moreover, related VLSI architectures for this sound recognition system are also proposed. The first one is the feature extraction module. The most complicated computations in the module are the division and nth-root operations. We utilize the CORDIC method to devise a divider. For the nth-root operation, a specific circuit is designed in accordance with the Brahmagupta iteration algorithm. For the Viterbi algorithm, a dedicated hardware architecture is also presented. This architecture is designed based on the 4-step fully Viterbi algorithm. This speed-up of this module is also ascribed to the fully pipeline systolic array architecture. Jhing-Fa Wang 王駿發 2003 學位論文 ; thesis 57 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立成功大學 === 電機工程學系碩博士班 === 91 === In this thesis, an environmental sound recognition system based on MPEG-7 features (centroid, spread, and flatness [1]) and its corresponding VLSI architectures are proposed. Traditional sound recognizer utilizes decision-tree based method and causes a problem where the parameter is not generalized [2~5]. The HMM based sound recognizer has been introduced by [8] to resolve this drawback. However, it adopts spectrum parameter and will result in high dimensional feature vectors. This thesis successfully solves the shortcoming by taking the basis extraction. The recognition rate is about 82% while only spectrogram is adopted as the parameter. The improved recognition rate is about 95% while above three mentioned MPEG-7 audio features are regarded as the parameters in our environmental sound recognizer. Moreover, related VLSI architectures for this sound recognition system are also proposed. The first one is the feature extraction module. The most complicated computations in the module are the division and nth-root operations. We utilize the CORDIC method to devise a divider. For the nth-root operation, a specific circuit is designed in accordance with the Brahmagupta iteration algorithm. For the Viterbi algorithm, a dedicated hardware architecture is also presented. This architecture is designed based on the 4-step fully Viterbi algorithm. This speed-up of this module is also ascribed to the fully pipeline systolic array architecture.
author2	Jhing-Fa Wang
author_facet	Jhing-Fa Wang Tze-Hsuan Huang 黃子軒
author	Tze-Hsuan Huang 黃子軒
spellingShingle	Tze-Hsuan Huang 黃子軒 VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
author_sort	Tze-Hsuan Huang
title	VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_short	VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_full	VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_fullStr	VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_full_unstemmed	VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
title_sort	vlsi architectures for home environmentalsound recognition based on mpeg-7 features
publishDate	2003
url	http://ndltd.ncl.edu.tw/handle/50906995482258528290
work_keys_str_mv	AT tzehsuanhuang vlsiarchitecturesforhomeenvironmentalsoundrecognitionbasedonmpeg7features AT huángzixuān vlsiarchitecturesforhomeenvironmentalsoundrecognitionbasedonmpeg7features AT tzehsuanhuang yǐmpeg7tèzhēngwèijīchǔdejūjiāhuánjìngshēngyīnbiànshíqìzhīchāodàxíngjītǐdiànlùjiàgòushèjì AT huángzixuān yǐmpeg7tèzhēngwèijīchǔdejūjiāhuánjìngshēngyīnbiànshíqìzhīchāodàxíngjītǐdiànlùjiàgòushèjì
_version_	1718314394335576064

VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features

Similar Items