LAUN Improved StarGAN for Facial Emotion Recognition
In the field of facial expression recognition, deep learning is extensively used. However, insufficient and unbalanced facial training data in available public databases is a major challenge for improving the expression recognition rate. Generative Adversarial Networks (GANs) can produce more one-to...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9186113/ |
id |
doaj-f3bf12c26bf64243bcdff478d3ac5077 |
---|---|
record_format |
Article |
spelling |
doaj-f3bf12c26bf64243bcdff478d3ac50772021-03-30T03:23:06ZengIEEEIEEE Access2169-35362020-01-01816150916151810.1109/ACCESS.2020.30215319186113LAUN Improved StarGAN for Facial Emotion RecognitionXiaohua Wang0https://orcid.org/0000-0003-1751-2291Jianqiao Gong1https://orcid.org/0000-0003-4362-5804Min Hu2https://orcid.org/0000-0003-2122-0240Yu Gu3https://orcid.org/0000-0001-6939-0850Fuji Ren4https://orcid.org/0000-0003-4860-9184Key Laboratory of Knowledge Engineering with Big Data, Ministry of Education, Hefei University of Technology, Hefei, ChinaKey Laboratory of Knowledge Engineering with Big Data, Ministry of Education, Hefei University of Technology, Hefei, ChinaKey Laboratory of Knowledge Engineering with Big Data, Ministry of Education, Hefei University of Technology, Hefei, ChinaKey Laboratory of Knowledge Engineering with Big Data, Ministry of Education, Hefei University of Technology, Hefei, ChinaAnhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, ChinaIn the field of facial expression recognition, deep learning is extensively used. However, insufficient and unbalanced facial training data in available public databases is a major challenge for improving the expression recognition rate. Generative Adversarial Networks (GANs) can produce more one-to-one faces with different expressions, which can be used to enhance databases. StarGAN can perform one-to-many translations for multiple expressions. Compared with original GANs, StarGAN can increase the efficiency of sample generation. Nevertheless, there are some defects in essential areas of the generated face, such as the mouth and the fuzzy side face image generation. To address these limitations, we improved StarGAN to alleviate the defects of images generation by modifying the reconstruction loss and adding the Contextual loss. Meanwhile, we added the Attention U-Net to StarGAN's generator, replacing StarGAN's original generator. Therefore, we proposed the Contextual loss and Attention U-Net (LAUN) improved StarGAN. The U-shape structure and skip connection in Attention U-Net can effectively integrate the details and semantic features of images. The network's attention structure can pay attention to the essential areas of the human face. The experimental results demonstrate that the improved model can alleviate some flaws in the face generated by the original StarGAN. Therefore, it can generate person images with better quality with different poses and expressions. The experiments were conducted on the Karolinska Directed Emotional Faces database, and the accuracy of facial expression recognition is 95.97%, 2.19% higher than that by using StarGAN. Meanwhile, the experiments were carried out on the MMI Facial Expression Database, and the accuracy of expression is 98.30%, 1.21% higher than that by using StarGAN. Moreover, experiment results have better performance based on the LAUN improved StarGAN enhanced databases than those without enhancement.https://ieeexplore.ieee.org/document/9186113/Facial expression recognitiondata enhancementgenerative adversarial networksself-attention |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiaohua Wang Jianqiao Gong Min Hu Yu Gu Fuji Ren |
spellingShingle |
Xiaohua Wang Jianqiao Gong Min Hu Yu Gu Fuji Ren LAUN Improved StarGAN for Facial Emotion Recognition IEEE Access Facial expression recognition data enhancement generative adversarial networks self-attention |
author_facet |
Xiaohua Wang Jianqiao Gong Min Hu Yu Gu Fuji Ren |
author_sort |
Xiaohua Wang |
title |
LAUN Improved StarGAN for Facial Emotion Recognition |
title_short |
LAUN Improved StarGAN for Facial Emotion Recognition |
title_full |
LAUN Improved StarGAN for Facial Emotion Recognition |
title_fullStr |
LAUN Improved StarGAN for Facial Emotion Recognition |
title_full_unstemmed |
LAUN Improved StarGAN for Facial Emotion Recognition |
title_sort |
laun improved stargan for facial emotion recognition |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
In the field of facial expression recognition, deep learning is extensively used. However, insufficient and unbalanced facial training data in available public databases is a major challenge for improving the expression recognition rate. Generative Adversarial Networks (GANs) can produce more one-to-one faces with different expressions, which can be used to enhance databases. StarGAN can perform one-to-many translations for multiple expressions. Compared with original GANs, StarGAN can increase the efficiency of sample generation. Nevertheless, there are some defects in essential areas of the generated face, such as the mouth and the fuzzy side face image generation. To address these limitations, we improved StarGAN to alleviate the defects of images generation by modifying the reconstruction loss and adding the Contextual loss. Meanwhile, we added the Attention U-Net to StarGAN's generator, replacing StarGAN's original generator. Therefore, we proposed the Contextual loss and Attention U-Net (LAUN) improved StarGAN. The U-shape structure and skip connection in Attention U-Net can effectively integrate the details and semantic features of images. The network's attention structure can pay attention to the essential areas of the human face. The experimental results demonstrate that the improved model can alleviate some flaws in the face generated by the original StarGAN. Therefore, it can generate person images with better quality with different poses and expressions. The experiments were conducted on the Karolinska Directed Emotional Faces database, and the accuracy of facial expression recognition is 95.97%, 2.19% higher than that by using StarGAN. Meanwhile, the experiments were carried out on the MMI Facial Expression Database, and the accuracy of expression is 98.30%, 1.21% higher than that by using StarGAN. Moreover, experiment results have better performance based on the LAUN improved StarGAN enhanced databases than those without enhancement. |
topic |
Facial expression recognition data enhancement generative adversarial networks self-attention |
url |
https://ieeexplore.ieee.org/document/9186113/ |
work_keys_str_mv |
AT xiaohuawang launimprovedstarganforfacialemotionrecognition AT jianqiaogong launimprovedstarganforfacialemotionrecognition AT minhu launimprovedstarganforfacialemotionrecognition AT yugu launimprovedstarganforfacialemotionrecognition AT fujiren launimprovedstarganforfacialemotionrecognition |
_version_ |
1724183589078171648 |