Learning Salient Segments for Speech Emotion Recognition Using Attentive Temporal Pooling

In the temporal process of expressing the emotions, some intervals embed more salient emotion information than others. In this paper, by introducing an attentive temporal pooling module into the deep neural network (DNN) architecture, we present a simple but effective speech emotion recognition (SER...

Full description

Bibliographic Details
Main Authors: Xiaohan Xia, Dongmei Jiang, Hichem Sahli
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9160979/