Wavelet based feature combination for recognition of emotions

In this paper, authors tried to develop reduced combinational features for emotional speech recognition. The spectral/cepstral features like wavelet coefficient, LPCC (linear prediction cepstral coefficient) and MFCC (mel-frequency cepstral coefficient) are used as baseline features. The frequency v...

Full description

Bibliographic Details
Main Authors: Hemanta Kumar Palo, Mihir Narayan Mohanty
Format: Article
Language:English
Published: Elsevier 2018-12-01
Series:Ain Shams Engineering Journal
Online Access:http://www.sciencedirect.com/science/article/pii/S2090447916301514
id doaj-4afb732ce6cf4df6b0c9500bcdae79c2
record_format Article
spelling doaj-4afb732ce6cf4df6b0c9500bcdae79c22021-06-02T13:59:19ZengElsevierAin Shams Engineering Journal2090-44792018-12-019417991806Wavelet based feature combination for recognition of emotionsHemanta Kumar Palo0Mihir Narayan Mohanty1Department of Electronics and Communication Engineering, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha, IndiaCorresponding author.; Department of Electronics and Communication Engineering, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha, IndiaIn this paper, authors tried to develop reduced combinational features for emotional speech recognition. The spectral/cepstral features like wavelet coefficient, LPCC (linear prediction cepstral coefficient) and MFCC (mel-frequency cepstral coefficient) are used as baseline features. The frequency variation with respect to time has been evaluated properly using wavelet coefficients. Again, MFCC and LPCC features are derived from wavelets, so that the exact information of the emotion has been fetched and are used as features. The features are reduced with vector quantization method and used in radial basis function network (RBFNN) classifier. The feature sets are also combined and tested in the classifier. This piece of work deals with five emotions as angry, fear, happy, disgust and neutral. These are tested for Berlin (EMO-DB) and Surrey Audio-Visual Expressed Emotion (SAVEE) database. The proposed frequency based decomposition and combination choice of features show excellent result and it is exhibited in result section. Keywords: Mel-frequency cepstral coefficient, Linear prediction cepstral coefficient, Wavelet, Feature combination, Classificationhttp://www.sciencedirect.com/science/article/pii/S2090447916301514
collection DOAJ
language English
format Article
sources DOAJ
author Hemanta Kumar Palo
Mihir Narayan Mohanty
spellingShingle Hemanta Kumar Palo
Mihir Narayan Mohanty
Wavelet based feature combination for recognition of emotions
Ain Shams Engineering Journal
author_facet Hemanta Kumar Palo
Mihir Narayan Mohanty
author_sort Hemanta Kumar Palo
title Wavelet based feature combination for recognition of emotions
title_short Wavelet based feature combination for recognition of emotions
title_full Wavelet based feature combination for recognition of emotions
title_fullStr Wavelet based feature combination for recognition of emotions
title_full_unstemmed Wavelet based feature combination for recognition of emotions
title_sort wavelet based feature combination for recognition of emotions
publisher Elsevier
series Ain Shams Engineering Journal
issn 2090-4479
publishDate 2018-12-01
description In this paper, authors tried to develop reduced combinational features for emotional speech recognition. The spectral/cepstral features like wavelet coefficient, LPCC (linear prediction cepstral coefficient) and MFCC (mel-frequency cepstral coefficient) are used as baseline features. The frequency variation with respect to time has been evaluated properly using wavelet coefficients. Again, MFCC and LPCC features are derived from wavelets, so that the exact information of the emotion has been fetched and are used as features. The features are reduced with vector quantization method and used in radial basis function network (RBFNN) classifier. The feature sets are also combined and tested in the classifier. This piece of work deals with five emotions as angry, fear, happy, disgust and neutral. These are tested for Berlin (EMO-DB) and Surrey Audio-Visual Expressed Emotion (SAVEE) database. The proposed frequency based decomposition and combination choice of features show excellent result and it is exhibited in result section. Keywords: Mel-frequency cepstral coefficient, Linear prediction cepstral coefficient, Wavelet, Feature combination, Classification
url http://www.sciencedirect.com/science/article/pii/S2090447916301514
work_keys_str_mv AT hemantakumarpalo waveletbasedfeaturecombinationforrecognitionofemotions
AT mihirnarayanmohanty waveletbasedfeaturecombinationforrecognitionofemotions
_version_ 1721403831319789568