Wavelet based feature combination for recognition of emotions
In this paper, authors tried to develop reduced combinational features for emotional speech recognition. The spectral/cepstral features like wavelet coefficient, LPCC (linear prediction cepstral coefficient) and MFCC (mel-frequency cepstral coefficient) are used as baseline features. The frequency v...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2018-12-01
|
Series: | Ain Shams Engineering Journal |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2090447916301514 |
id |
doaj-4afb732ce6cf4df6b0c9500bcdae79c2 |
---|---|
record_format |
Article |
spelling |
doaj-4afb732ce6cf4df6b0c9500bcdae79c22021-06-02T13:59:19ZengElsevierAin Shams Engineering Journal2090-44792018-12-019417991806Wavelet based feature combination for recognition of emotionsHemanta Kumar Palo0Mihir Narayan Mohanty1Department of Electronics and Communication Engineering, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha, IndiaCorresponding author.; Department of Electronics and Communication Engineering, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha, IndiaIn this paper, authors tried to develop reduced combinational features for emotional speech recognition. The spectral/cepstral features like wavelet coefficient, LPCC (linear prediction cepstral coefficient) and MFCC (mel-frequency cepstral coefficient) are used as baseline features. The frequency variation with respect to time has been evaluated properly using wavelet coefficients. Again, MFCC and LPCC features are derived from wavelets, so that the exact information of the emotion has been fetched and are used as features. The features are reduced with vector quantization method and used in radial basis function network (RBFNN) classifier. The feature sets are also combined and tested in the classifier. This piece of work deals with five emotions as angry, fear, happy, disgust and neutral. These are tested for Berlin (EMO-DB) and Surrey Audio-Visual Expressed Emotion (SAVEE) database. The proposed frequency based decomposition and combination choice of features show excellent result and it is exhibited in result section. Keywords: Mel-frequency cepstral coefficient, Linear prediction cepstral coefficient, Wavelet, Feature combination, Classificationhttp://www.sciencedirect.com/science/article/pii/S2090447916301514 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Hemanta Kumar Palo Mihir Narayan Mohanty |
spellingShingle |
Hemanta Kumar Palo Mihir Narayan Mohanty Wavelet based feature combination for recognition of emotions Ain Shams Engineering Journal |
author_facet |
Hemanta Kumar Palo Mihir Narayan Mohanty |
author_sort |
Hemanta Kumar Palo |
title |
Wavelet based feature combination for recognition of emotions |
title_short |
Wavelet based feature combination for recognition of emotions |
title_full |
Wavelet based feature combination for recognition of emotions |
title_fullStr |
Wavelet based feature combination for recognition of emotions |
title_full_unstemmed |
Wavelet based feature combination for recognition of emotions |
title_sort |
wavelet based feature combination for recognition of emotions |
publisher |
Elsevier |
series |
Ain Shams Engineering Journal |
issn |
2090-4479 |
publishDate |
2018-12-01 |
description |
In this paper, authors tried to develop reduced combinational features for emotional speech recognition. The spectral/cepstral features like wavelet coefficient, LPCC (linear prediction cepstral coefficient) and MFCC (mel-frequency cepstral coefficient) are used as baseline features. The frequency variation with respect to time has been evaluated properly using wavelet coefficients. Again, MFCC and LPCC features are derived from wavelets, so that the exact information of the emotion has been fetched and are used as features. The features are reduced with vector quantization method and used in radial basis function network (RBFNN) classifier. The feature sets are also combined and tested in the classifier. This piece of work deals with five emotions as angry, fear, happy, disgust and neutral. These are tested for Berlin (EMO-DB) and Surrey Audio-Visual Expressed Emotion (SAVEE) database. The proposed frequency based decomposition and combination choice of features show excellent result and it is exhibited in result section. Keywords: Mel-frequency cepstral coefficient, Linear prediction cepstral coefficient, Wavelet, Feature combination, Classification |
url |
http://www.sciencedirect.com/science/article/pii/S2090447916301514 |
work_keys_str_mv |
AT hemantakumarpalo waveletbasedfeaturecombinationforrecognitionofemotions AT mihirnarayanmohanty waveletbasedfeaturecombinationforrecognitionofemotions |
_version_ |
1721403831319789568 |