Context-Aware Attention Network for Human Emotion Recognition in Video

Recognition of human emotion from facial expression is affected by distortions of pictorial quality and facial pose, which is often ignored by traditional video emotion recognition methods. On the other hand, context information can also provide different degrees of extra clues, which can further im...

Full description

Bibliographic Details
Main Authors: Xiaodong Liu, Miao Wang
Format: Article
Language:English
Published: Hindawi Limited 2020-01-01
Series:Advances in Multimedia
Online Access:http://dx.doi.org/10.1155/2020/8843413
id doaj-10d09fea222a42808224d436d5199d54
record_format Article
spelling doaj-10d09fea222a42808224d436d5199d542020-11-25T04:11:59ZengHindawi LimitedAdvances in Multimedia1687-56801687-56992020-01-01202010.1155/2020/88434138843413Context-Aware Attention Network for Human Emotion Recognition in VideoXiaodong Liu0Miao Wang1School of Computing Henan University of Engineering, Zhengzhou, ChinaSchool of Computing Henan University of Engineering, Zhengzhou, ChinaRecognition of human emotion from facial expression is affected by distortions of pictorial quality and facial pose, which is often ignored by traditional video emotion recognition methods. On the other hand, context information can also provide different degrees of extra clues, which can further improve the recognition accuracy. In this paper, we first build a video dataset with seven categories of human emotion, named human emotion in the video (HEIV). With the HEIV dataset, we trained a context-aware attention network (CAAN) to recognize human emotion. The network consists of two subnetworks to process both face and context information. Features from facial expression and context clues are fused to represent the emotion of video frames, which will be then passed through an attention network and generate emotion scores. Then, the emotion features of all frames will be aggregated according to their emotional score. Experimental results show that our proposed method is effective on HEIV dataset.http://dx.doi.org/10.1155/2020/8843413
collection DOAJ
language English
format Article
sources DOAJ
author Xiaodong Liu
Miao Wang
spellingShingle Xiaodong Liu
Miao Wang
Context-Aware Attention Network for Human Emotion Recognition in Video
Advances in Multimedia
author_facet Xiaodong Liu
Miao Wang
author_sort Xiaodong Liu
title Context-Aware Attention Network for Human Emotion Recognition in Video
title_short Context-Aware Attention Network for Human Emotion Recognition in Video
title_full Context-Aware Attention Network for Human Emotion Recognition in Video
title_fullStr Context-Aware Attention Network for Human Emotion Recognition in Video
title_full_unstemmed Context-Aware Attention Network for Human Emotion Recognition in Video
title_sort context-aware attention network for human emotion recognition in video
publisher Hindawi Limited
series Advances in Multimedia
issn 1687-5680
1687-5699
publishDate 2020-01-01
description Recognition of human emotion from facial expression is affected by distortions of pictorial quality and facial pose, which is often ignored by traditional video emotion recognition methods. On the other hand, context information can also provide different degrees of extra clues, which can further improve the recognition accuracy. In this paper, we first build a video dataset with seven categories of human emotion, named human emotion in the video (HEIV). With the HEIV dataset, we trained a context-aware attention network (CAAN) to recognize human emotion. The network consists of two subnetworks to process both face and context information. Features from facial expression and context clues are fused to represent the emotion of video frames, which will be then passed through an attention network and generate emotion scores. Then, the emotion features of all frames will be aggregated according to their emotional score. Experimental results show that our proposed method is effective on HEIV dataset.
url http://dx.doi.org/10.1155/2020/8843413
work_keys_str_mv AT xiaodongliu contextawareattentionnetworkforhumanemotionrecognitioninvideo
AT miaowang contextawareattentionnetworkforhumanemotionrecognitioninvideo
_version_ 1715033075319046144