VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features
碩士 === 國立成功大學 === 電機工程學系碩博士班 === 91 === In this thesis, an environmental sound recognition system based on MPEG-7 features (centroid, spread, and flatness [1]) and its corresponding VLSI architectures are proposed. Traditional sound recognizer utilizes decision-tree based method and causes a problem...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2003
|
Online Access: | http://ndltd.ncl.edu.tw/handle/50906995482258528290 |
id |
ndltd-TW-091NCKU5442211 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-091NCKU54422112016-06-22T04:14:02Z http://ndltd.ncl.edu.tw/handle/50906995482258528290 VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features 以MPEG-7特徵為基礎的居家環境聲音辨識器之超大型積體電路架構設計 Tze-Hsuan Huang 黃子軒 碩士 國立成功大學 電機工程學系碩博士班 91 In this thesis, an environmental sound recognition system based on MPEG-7 features (centroid, spread, and flatness [1]) and its corresponding VLSI architectures are proposed. Traditional sound recognizer utilizes decision-tree based method and causes a problem where the parameter is not generalized [2~5]. The HMM based sound recognizer has been introduced by [8] to resolve this drawback. However, it adopts spectrum parameter and will result in high dimensional feature vectors. This thesis successfully solves the shortcoming by taking the basis extraction. The recognition rate is about 82% while only spectrogram is adopted as the parameter. The improved recognition rate is about 95% while above three mentioned MPEG-7 audio features are regarded as the parameters in our environmental sound recognizer. Moreover, related VLSI architectures for this sound recognition system are also proposed. The first one is the feature extraction module. The most complicated computations in the module are the division and nth-root operations. We utilize the CORDIC method to devise a divider. For the nth-root operation, a specific circuit is designed in accordance with the Brahmagupta iteration algorithm. For the Viterbi algorithm, a dedicated hardware architecture is also presented. This architecture is designed based on the 4-step fully Viterbi algorithm. This speed-up of this module is also ascribed to the fully pipeline systolic array architecture. Jhing-Fa Wang 王駿發 2003 學位論文 ; thesis 57 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 電機工程學系碩博士班 === 91 === In this thesis, an environmental sound recognition system based on MPEG-7 features (centroid, spread, and flatness [1]) and its corresponding VLSI architectures are proposed. Traditional sound recognizer utilizes decision-tree based method and causes a problem where the parameter is not generalized [2~5]. The HMM based sound recognizer has been introduced by [8] to resolve this drawback. However, it adopts spectrum parameter and will result in high dimensional feature vectors. This thesis successfully solves the shortcoming by taking the basis extraction. The recognition rate is about 82% while only spectrogram is adopted as the parameter. The improved recognition rate is about 95% while above three mentioned MPEG-7 audio features are regarded as the parameters in our environmental sound recognizer.
Moreover, related VLSI architectures for this sound recognition system are also proposed. The first one is the feature extraction module. The most complicated computations in the module are the division and nth-root operations. We utilize the CORDIC method to devise a divider. For the nth-root operation, a specific circuit is designed in accordance with the Brahmagupta iteration algorithm. For the Viterbi algorithm, a dedicated hardware architecture is also presented. This architecture is designed based on the 4-step fully Viterbi algorithm. This speed-up of this module is also ascribed to the fully pipeline systolic array architecture.
|
author2 |
Jhing-Fa Wang |
author_facet |
Jhing-Fa Wang Tze-Hsuan Huang 黃子軒 |
author |
Tze-Hsuan Huang 黃子軒 |
spellingShingle |
Tze-Hsuan Huang 黃子軒 VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features |
author_sort |
Tze-Hsuan Huang |
title |
VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features |
title_short |
VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features |
title_full |
VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features |
title_fullStr |
VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features |
title_full_unstemmed |
VLSI Architectures for Home EnvironmentalSound Recognition Based on MPEG-7 Features |
title_sort |
vlsi architectures for home environmentalsound recognition based on mpeg-7 features |
publishDate |
2003 |
url |
http://ndltd.ncl.edu.tw/handle/50906995482258528290 |
work_keys_str_mv |
AT tzehsuanhuang vlsiarchitecturesforhomeenvironmentalsoundrecognitionbasedonmpeg7features AT huángzixuān vlsiarchitecturesforhomeenvironmentalsoundrecognitionbasedonmpeg7features AT tzehsuanhuang yǐmpeg7tèzhēngwèijīchǔdejūjiāhuánjìngshēngyīnbiànshíqìzhīchāodàxíngjītǐdiànlùjiàgòushèjì AT huángzixuān yǐmpeg7tèzhēngwèijīchǔdejūjiāhuánjìngshēngyīnbiànshíqìzhīchāodàxíngjītǐdiànlùjiàgòushèjì |
_version_ |
1718314394335576064 |