A Study on Content-based Audio Classification Using Probabilistic SVMs and ICA

碩士 === 國立成功大學 === 電機工程學系碩博士班 === 94 ===   Different kinds of sound have different properties in our life environment, and we can make out surroundings by recognizing and understanding these properties of environmental sounds. For example, when we hear the fire alarm sound, we can judge there must be...

Full description

Bibliographic Details
Main Authors: Cai-Bei Lin, 林財貝
Other Authors: Jhing-Fa Wang
Format: Others
Language:en_US
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/87349046616914289202
id ndltd-TW-094NCKU5442131
record_format oai_dc
spelling ndltd-TW-094NCKU54421312015-12-16T04:31:52Z http://ndltd.ncl.edu.tw/handle/87349046616914289202 A Study on Content-based Audio Classification Using Probabilistic SVMs and ICA 應用機率型SVMs與ICA於以內容為基礎音訊分類之研究 Cai-Bei Lin 林財貝 碩士 國立成功大學 電機工程學系碩博士班 94   Different kinds of sound have different properties in our life environment, and we can make out surroundings by recognizing and understanding these properties of environmental sounds. For example, when we hear the fire alarm sound, we can judge there must be fire happening. It will be a great help to us for monitoring surrounding environment if we can classify and identify in accordance with the sound information, especially for the deaf person and security system. Besides, as mentioned in the former article, large amount of information is recorded in files format of audio. Making use of audio classification will be contributive to us for searching the audio segment we want.   In this thesis, we present a home environmental audio classifier based on support vector machine (SVM) and independent component analysis. We use independent component analysis to extract the audio feature. This technique can extract independent components based on statistical characteristics. The proposed audio features can be categorized as three sets. The first feature set is perceptual features which include total spectrum power, subband powers, brightness, bandwidth and pitch. The second feature set consists of MFCC and delta MFCC. The third feature set is the ICA-transformed MFCC feature. This is achieved by transforming the MFCC feature using ICA transform. The ICA transform is literately obtained based on all the training audio data. The audio classifier is designed using probabilistic SVMs. We collect an audio database contained 649 wav files of 15 classes. Experiments demonstrate the proposed sound classifier can achieve a 97.52% classification rate. Jhing-Fa Wang 王駿發 2006 學位論文 ; thesis 51 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 電機工程學系碩博士班 === 94 ===   Different kinds of sound have different properties in our life environment, and we can make out surroundings by recognizing and understanding these properties of environmental sounds. For example, when we hear the fire alarm sound, we can judge there must be fire happening. It will be a great help to us for monitoring surrounding environment if we can classify and identify in accordance with the sound information, especially for the deaf person and security system. Besides, as mentioned in the former article, large amount of information is recorded in files format of audio. Making use of audio classification will be contributive to us for searching the audio segment we want.   In this thesis, we present a home environmental audio classifier based on support vector machine (SVM) and independent component analysis. We use independent component analysis to extract the audio feature. This technique can extract independent components based on statistical characteristics. The proposed audio features can be categorized as three sets. The first feature set is perceptual features which include total spectrum power, subband powers, brightness, bandwidth and pitch. The second feature set consists of MFCC and delta MFCC. The third feature set is the ICA-transformed MFCC feature. This is achieved by transforming the MFCC feature using ICA transform. The ICA transform is literately obtained based on all the training audio data. The audio classifier is designed using probabilistic SVMs. We collect an audio database contained 649 wav files of 15 classes. Experiments demonstrate the proposed sound classifier can achieve a 97.52% classification rate.
author2 Jhing-Fa Wang
author_facet Jhing-Fa Wang
Cai-Bei Lin
林財貝
author Cai-Bei Lin
林財貝
spellingShingle Cai-Bei Lin
林財貝
A Study on Content-based Audio Classification Using Probabilistic SVMs and ICA
author_sort Cai-Bei Lin
title A Study on Content-based Audio Classification Using Probabilistic SVMs and ICA
title_short A Study on Content-based Audio Classification Using Probabilistic SVMs and ICA
title_full A Study on Content-based Audio Classification Using Probabilistic SVMs and ICA
title_fullStr A Study on Content-based Audio Classification Using Probabilistic SVMs and ICA
title_full_unstemmed A Study on Content-based Audio Classification Using Probabilistic SVMs and ICA
title_sort study on content-based audio classification using probabilistic svms and ica
publishDate 2006
url http://ndltd.ncl.edu.tw/handle/87349046616914289202
work_keys_str_mv AT caibeilin astudyoncontentbasedaudioclassificationusingprobabilisticsvmsandica
AT líncáibèi astudyoncontentbasedaudioclassificationusingprobabilisticsvmsandica
AT caibeilin yīngyòngjīlǜxíngsvmsyǔicayúyǐnèiróngwèijīchǔyīnxùnfēnlèizhīyánjiū
AT líncáibèi yīngyòngjīlǜxíngsvmsyǔicayúyǐnèiróngwèijīchǔyīnxùnfēnlèizhīyánjiū
AT caibeilin studyoncontentbasedaudioclassificationusingprobabilisticsvmsandica
AT líncáibèi studyoncontentbasedaudioclassificationusingprobabilisticsvmsandica
_version_ 1718149131080302592