Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition

Recognizing facial expression has attracted much more attention due to its broad range of applications in human–computer interaction systems. Although facial representation is crucial to final recognition accuracy, traditional handcrafted representations only reflect shallow characteristics and it i...

Full description

Bibliographic Details
Main Authors: Xingcan Liang, Linsen Xu, Jinfu Liu, Zhipeng Liu, Gaoxin Cheng, Jiajun Xu, Lei Liu
Format: Article
Language:English
Published: MDPI AG 2021-01-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/21/3/833
id doaj-b4924f9d95334c37a10508faec3d9e8e
record_format Article
spelling doaj-b4924f9d95334c37a10508faec3d9e8e2021-01-28T00:00:59ZengMDPI AGSensors1424-82202021-01-012183383310.3390/s21030833Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression RecognitionXingcan Liang0Linsen Xu1Jinfu Liu2Zhipeng Liu3Gaoxin Cheng4Jiajun Xu5Lei Liu6Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaUniversity of Science and Technology of China, Hefei 230026, ChinaUniversity of Science and Technology of China, Hefei 230026, ChinaRecognizing facial expression has attracted much more attention due to its broad range of applications in human–computer interaction systems. Although facial representation is crucial to final recognition accuracy, traditional handcrafted representations only reflect shallow characteristics and it is uncertain whether the convolutional layer can extract better ones. In addition, the policy that weights are shared across a whole image is improper for structured face images. To overcome such limitations, a novel method based on patches of interest, the Patch Attention Layer (PAL) of embedding handcrafted features, is proposed to learn the local shallow facial features of each patch on face images. Firstly, a handcrafted feature, Gabor surface feature (GSF), is extracted by convolving the input face image with a set of predefined Gabor filters. Secondly, the generated feature is segmented as nonoverlapped patches that can capture local shallow features by the strategy of using different local patches with different filters. Then, the weighted shallow features are fed into the remaining convolutional layers to capture high-level features. Our method can be carried out directly on a static image without facial landmark information, and the preprocessing step is very simple. Experiments on four databases show that our method achieved very competitive performance (Extended Cohn–Kanade database (CK+): 98.93%; Oulu-CASIA: 97.57%; Japanese Female Facial Expressions database (JAFFE): 93.38%; and RAF-DB: 86.8%) compared to other state-of-the-art methods.https://www.mdpi.com/1424-8220/21/3/833facial expression recognitionpatch attentionshallow featurefeature extractionfacial representationconvolutional layer
collection DOAJ
language English
format Article
sources DOAJ
author Xingcan Liang
Linsen Xu
Jinfu Liu
Zhipeng Liu
Gaoxin Cheng
Jiajun Xu
Lei Liu
spellingShingle Xingcan Liang
Linsen Xu
Jinfu Liu
Zhipeng Liu
Gaoxin Cheng
Jiajun Xu
Lei Liu
Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
Sensors
facial expression recognition
patch attention
shallow feature
feature extraction
facial representation
convolutional layer
author_facet Xingcan Liang
Linsen Xu
Jinfu Liu
Zhipeng Liu
Gaoxin Cheng
Jiajun Xu
Lei Liu
author_sort Xingcan Liang
title Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_short Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_full Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_fullStr Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_full_unstemmed Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_sort patch attention layer of embedding handcrafted features in cnn for facial expression recognition
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2021-01-01
description Recognizing facial expression has attracted much more attention due to its broad range of applications in human–computer interaction systems. Although facial representation is crucial to final recognition accuracy, traditional handcrafted representations only reflect shallow characteristics and it is uncertain whether the convolutional layer can extract better ones. In addition, the policy that weights are shared across a whole image is improper for structured face images. To overcome such limitations, a novel method based on patches of interest, the Patch Attention Layer (PAL) of embedding handcrafted features, is proposed to learn the local shallow facial features of each patch on face images. Firstly, a handcrafted feature, Gabor surface feature (GSF), is extracted by convolving the input face image with a set of predefined Gabor filters. Secondly, the generated feature is segmented as nonoverlapped patches that can capture local shallow features by the strategy of using different local patches with different filters. Then, the weighted shallow features are fed into the remaining convolutional layers to capture high-level features. Our method can be carried out directly on a static image without facial landmark information, and the preprocessing step is very simple. Experiments on four databases show that our method achieved very competitive performance (Extended Cohn–Kanade database (CK+): 98.93%; Oulu-CASIA: 97.57%; Japanese Female Facial Expressions database (JAFFE): 93.38%; and RAF-DB: 86.8%) compared to other state-of-the-art methods.
topic facial expression recognition
patch attention
shallow feature
feature extraction
facial representation
convolutional layer
url https://www.mdpi.com/1424-8220/21/3/833
work_keys_str_mv AT xingcanliang patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition
AT linsenxu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition
AT jinfuliu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition
AT zhipengliu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition
AT gaoxincheng patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition
AT jiajunxu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition
AT leiliu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition
_version_ 1724320369872994304