LAUN Improved StarGAN for Facial Emotion Recognition

In the field of facial expression recognition, deep learning is extensively used. However, insufficient and unbalanced facial training data in available public databases is a major challenge for improving the expression recognition rate. Generative Adversarial Networks (GANs) can produce more one-to...

Full description

Bibliographic Details
Main Authors: Xiaohua Wang, Jianqiao Gong, Min Hu, Yu Gu, Fuji Ren
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9186113/
id doaj-f3bf12c26bf64243bcdff478d3ac5077
record_format Article
spelling doaj-f3bf12c26bf64243bcdff478d3ac50772021-03-30T03:23:06ZengIEEEIEEE Access2169-35362020-01-01816150916151810.1109/ACCESS.2020.30215319186113LAUN Improved StarGAN for Facial Emotion RecognitionXiaohua Wang0https://orcid.org/0000-0003-1751-2291Jianqiao Gong1https://orcid.org/0000-0003-4362-5804Min Hu2https://orcid.org/0000-0003-2122-0240Yu Gu3https://orcid.org/0000-0001-6939-0850Fuji Ren4https://orcid.org/0000-0003-4860-9184Key Laboratory of Knowledge Engineering with Big Data, Ministry of Education, Hefei University of Technology, Hefei, ChinaKey Laboratory of Knowledge Engineering with Big Data, Ministry of Education, Hefei University of Technology, Hefei, ChinaKey Laboratory of Knowledge Engineering with Big Data, Ministry of Education, Hefei University of Technology, Hefei, ChinaKey Laboratory of Knowledge Engineering with Big Data, Ministry of Education, Hefei University of Technology, Hefei, ChinaAnhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, ChinaIn the field of facial expression recognition, deep learning is extensively used. However, insufficient and unbalanced facial training data in available public databases is a major challenge for improving the expression recognition rate. Generative Adversarial Networks (GANs) can produce more one-to-one faces with different expressions, which can be used to enhance databases. StarGAN can perform one-to-many translations for multiple expressions. Compared with original GANs, StarGAN can increase the efficiency of sample generation. Nevertheless, there are some defects in essential areas of the generated face, such as the mouth and the fuzzy side face image generation. To address these limitations, we improved StarGAN to alleviate the defects of images generation by modifying the reconstruction loss and adding the Contextual loss. Meanwhile, we added the Attention U-Net to StarGAN's generator, replacing StarGAN's original generator. Therefore, we proposed the Contextual loss and Attention U-Net (LAUN) improved StarGAN. The U-shape structure and skip connection in Attention U-Net can effectively integrate the details and semantic features of images. The network's attention structure can pay attention to the essential areas of the human face. The experimental results demonstrate that the improved model can alleviate some flaws in the face generated by the original StarGAN. Therefore, it can generate person images with better quality with different poses and expressions. The experiments were conducted on the Karolinska Directed Emotional Faces database, and the accuracy of facial expression recognition is 95.97%, 2.19% higher than that by using StarGAN. Meanwhile, the experiments were carried out on the MMI Facial Expression Database, and the accuracy of expression is 98.30%, 1.21% higher than that by using StarGAN. Moreover, experiment results have better performance based on the LAUN improved StarGAN enhanced databases than those without enhancement.https://ieeexplore.ieee.org/document/9186113/Facial expression recognitiondata enhancementgenerative adversarial networksself-attention
collection DOAJ
language English
format Article
sources DOAJ
author Xiaohua Wang
Jianqiao Gong
Min Hu
Yu Gu
Fuji Ren
spellingShingle Xiaohua Wang
Jianqiao Gong
Min Hu
Yu Gu
Fuji Ren
LAUN Improved StarGAN for Facial Emotion Recognition
IEEE Access
Facial expression recognition
data enhancement
generative adversarial networks
self-attention
author_facet Xiaohua Wang
Jianqiao Gong
Min Hu
Yu Gu
Fuji Ren
author_sort Xiaohua Wang
title LAUN Improved StarGAN for Facial Emotion Recognition
title_short LAUN Improved StarGAN for Facial Emotion Recognition
title_full LAUN Improved StarGAN for Facial Emotion Recognition
title_fullStr LAUN Improved StarGAN for Facial Emotion Recognition
title_full_unstemmed LAUN Improved StarGAN for Facial Emotion Recognition
title_sort laun improved stargan for facial emotion recognition
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description In the field of facial expression recognition, deep learning is extensively used. However, insufficient and unbalanced facial training data in available public databases is a major challenge for improving the expression recognition rate. Generative Adversarial Networks (GANs) can produce more one-to-one faces with different expressions, which can be used to enhance databases. StarGAN can perform one-to-many translations for multiple expressions. Compared with original GANs, StarGAN can increase the efficiency of sample generation. Nevertheless, there are some defects in essential areas of the generated face, such as the mouth and the fuzzy side face image generation. To address these limitations, we improved StarGAN to alleviate the defects of images generation by modifying the reconstruction loss and adding the Contextual loss. Meanwhile, we added the Attention U-Net to StarGAN's generator, replacing StarGAN's original generator. Therefore, we proposed the Contextual loss and Attention U-Net (LAUN) improved StarGAN. The U-shape structure and skip connection in Attention U-Net can effectively integrate the details and semantic features of images. The network's attention structure can pay attention to the essential areas of the human face. The experimental results demonstrate that the improved model can alleviate some flaws in the face generated by the original StarGAN. Therefore, it can generate person images with better quality with different poses and expressions. The experiments were conducted on the Karolinska Directed Emotional Faces database, and the accuracy of facial expression recognition is 95.97%, 2.19% higher than that by using StarGAN. Meanwhile, the experiments were carried out on the MMI Facial Expression Database, and the accuracy of expression is 98.30%, 1.21% higher than that by using StarGAN. Moreover, experiment results have better performance based on the LAUN improved StarGAN enhanced databases than those without enhancement.
topic Facial expression recognition
data enhancement
generative adversarial networks
self-attention
url https://ieeexplore.ieee.org/document/9186113/
work_keys_str_mv AT xiaohuawang launimprovedstarganforfacialemotionrecognition
AT jianqiaogong launimprovedstarganforfacialemotionrecognition
AT minhu launimprovedstarganforfacialemotionrecognition
AT yugu launimprovedstarganforfacialemotionrecognition
AT fujiren launimprovedstarganforfacialemotionrecognition
_version_ 1724183589078171648