Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
Recognizing facial expression has attracted much more attention due to its broad range of applications in human–computer interaction systems. Although facial representation is crucial to final recognition accuracy, traditional handcrafted representations only reflect shallow characteristics and it i...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-01-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/21/3/833 |
id |
doaj-b4924f9d95334c37a10508faec3d9e8e |
---|---|
record_format |
Article |
spelling |
doaj-b4924f9d95334c37a10508faec3d9e8e2021-01-28T00:00:59ZengMDPI AGSensors1424-82202021-01-012183383310.3390/s21030833Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression RecognitionXingcan Liang0Linsen Xu1Jinfu Liu2Zhipeng Liu3Gaoxin Cheng4Jiajun Xu5Lei Liu6Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaUniversity of Science and Technology of China, Hefei 230026, ChinaUniversity of Science and Technology of China, Hefei 230026, ChinaRecognizing facial expression has attracted much more attention due to its broad range of applications in human–computer interaction systems. Although facial representation is crucial to final recognition accuracy, traditional handcrafted representations only reflect shallow characteristics and it is uncertain whether the convolutional layer can extract better ones. In addition, the policy that weights are shared across a whole image is improper for structured face images. To overcome such limitations, a novel method based on patches of interest, the Patch Attention Layer (PAL) of embedding handcrafted features, is proposed to learn the local shallow facial features of each patch on face images. Firstly, a handcrafted feature, Gabor surface feature (GSF), is extracted by convolving the input face image with a set of predefined Gabor filters. Secondly, the generated feature is segmented as nonoverlapped patches that can capture local shallow features by the strategy of using different local patches with different filters. Then, the weighted shallow features are fed into the remaining convolutional layers to capture high-level features. Our method can be carried out directly on a static image without facial landmark information, and the preprocessing step is very simple. Experiments on four databases show that our method achieved very competitive performance (Extended Cohn–Kanade database (CK+): 98.93%; Oulu-CASIA: 97.57%; Japanese Female Facial Expressions database (JAFFE): 93.38%; and RAF-DB: 86.8%) compared to other state-of-the-art methods.https://www.mdpi.com/1424-8220/21/3/833facial expression recognitionpatch attentionshallow featurefeature extractionfacial representationconvolutional layer |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xingcan Liang Linsen Xu Jinfu Liu Zhipeng Liu Gaoxin Cheng Jiajun Xu Lei Liu |
spellingShingle |
Xingcan Liang Linsen Xu Jinfu Liu Zhipeng Liu Gaoxin Cheng Jiajun Xu Lei Liu Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition Sensors facial expression recognition patch attention shallow feature feature extraction facial representation convolutional layer |
author_facet |
Xingcan Liang Linsen Xu Jinfu Liu Zhipeng Liu Gaoxin Cheng Jiajun Xu Lei Liu |
author_sort |
Xingcan Liang |
title |
Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition |
title_short |
Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition |
title_full |
Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition |
title_fullStr |
Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition |
title_full_unstemmed |
Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition |
title_sort |
patch attention layer of embedding handcrafted features in cnn for facial expression recognition |
publisher |
MDPI AG |
series |
Sensors |
issn |
1424-8220 |
publishDate |
2021-01-01 |
description |
Recognizing facial expression has attracted much more attention due to its broad range of applications in human–computer interaction systems. Although facial representation is crucial to final recognition accuracy, traditional handcrafted representations only reflect shallow characteristics and it is uncertain whether the convolutional layer can extract better ones. In addition, the policy that weights are shared across a whole image is improper for structured face images. To overcome such limitations, a novel method based on patches of interest, the Patch Attention Layer (PAL) of embedding handcrafted features, is proposed to learn the local shallow facial features of each patch on face images. Firstly, a handcrafted feature, Gabor surface feature (GSF), is extracted by convolving the input face image with a set of predefined Gabor filters. Secondly, the generated feature is segmented as nonoverlapped patches that can capture local shallow features by the strategy of using different local patches with different filters. Then, the weighted shallow features are fed into the remaining convolutional layers to capture high-level features. Our method can be carried out directly on a static image without facial landmark information, and the preprocessing step is very simple. Experiments on four databases show that our method achieved very competitive performance (Extended Cohn–Kanade database (CK+): 98.93%; Oulu-CASIA: 97.57%; Japanese Female Facial Expressions database (JAFFE): 93.38%; and RAF-DB: 86.8%) compared to other state-of-the-art methods. |
topic |
facial expression recognition patch attention shallow feature feature extraction facial representation convolutional layer |
url |
https://www.mdpi.com/1424-8220/21/3/833 |
work_keys_str_mv |
AT xingcanliang patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT linsenxu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT jinfuliu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT zhipengliu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT gaoxincheng patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT jiajunxu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT leiliu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition |
_version_ |
1724320369872994304 |