Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition

Recognizing facial expression has attracted much more attention due to its broad range of applications in human–computer interaction systems. Although facial representation is crucial to final recognition accuracy, traditional handcrafted representations only reflect shallow characteristics and it i...

Full description

Bibliographic Details
Main Authors:	Xingcan Liang, Linsen Xu, Jinfu Liu, Zhipeng Liu, Gaoxin Cheng, Jiajun Xu, Lei Liu
Format:	Article
Language:	English
Published:	MDPI AG 2021-01-01
Series:	Sensors
Subjects:	facial expression recognition patch attention shallow feature feature extraction facial representation convolutional layer
Online Access:	https://www.mdpi.com/1424-8220/21/3/833

id	doaj-b4924f9d95334c37a10508faec3d9e8e
record_format	Article
spelling	doaj-b4924f9d95334c37a10508faec3d9e8e2021-01-28T00:00:59ZengMDPI AGSensors1424-82202021-01-012183383310.3390/s21030833Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression RecognitionXingcan Liang0Linsen Xu1Jinfu Liu2Zhipeng Liu3Gaoxin Cheng4Jiajun Xu5Lei Liu6Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaHefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, ChinaUniversity of Science and Technology of China, Hefei 230026, ChinaUniversity of Science and Technology of China, Hefei 230026, ChinaRecognizing facial expression has attracted much more attention due to its broad range of applications in human–computer interaction systems. Although facial representation is crucial to final recognition accuracy, traditional handcrafted representations only reflect shallow characteristics and it is uncertain whether the convolutional layer can extract better ones. In addition, the policy that weights are shared across a whole image is improper for structured face images. To overcome such limitations, a novel method based on patches of interest, the Patch Attention Layer (PAL) of embedding handcrafted features, is proposed to learn the local shallow facial features of each patch on face images. Firstly, a handcrafted feature, Gabor surface feature (GSF), is extracted by convolving the input face image with a set of predefined Gabor filters. Secondly, the generated feature is segmented as nonoverlapped patches that can capture local shallow features by the strategy of using different local patches with different filters. Then, the weighted shallow features are fed into the remaining convolutional layers to capture high-level features. Our method can be carried out directly on a static image without facial landmark information, and the preprocessing step is very simple. Experiments on four databases show that our method achieved very competitive performance (Extended Cohn–Kanade database (CK+): 98.93%; Oulu-CASIA: 97.57%; Japanese Female Facial Expressions database (JAFFE): 93.38%; and RAF-DB: 86.8%) compared to other state-of-the-art methods.https://www.mdpi.com/1424-8220/21/3/833facial expression recognitionpatch attentionshallow featurefeature extractionfacial representationconvolutional layer
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Xingcan Liang Linsen Xu Jinfu Liu Zhipeng Liu Gaoxin Cheng Jiajun Xu Lei Liu
spellingShingle	Xingcan Liang Linsen Xu Jinfu Liu Zhipeng Liu Gaoxin Cheng Jiajun Xu Lei Liu Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition Sensors facial expression recognition patch attention shallow feature feature extraction facial representation convolutional layer
author_facet	Xingcan Liang Linsen Xu Jinfu Liu Zhipeng Liu Gaoxin Cheng Jiajun Xu Lei Liu
author_sort	Xingcan Liang
title	Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_short	Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_full	Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_fullStr	Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_full_unstemmed	Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition
title_sort	patch attention layer of embedding handcrafted features in cnn for facial expression recognition
publisher	MDPI AG
series	Sensors
issn	1424-8220
publishDate	2021-01-01
description	Recognizing facial expression has attracted much more attention due to its broad range of applications in human–computer interaction systems. Although facial representation is crucial to final recognition accuracy, traditional handcrafted representations only reflect shallow characteristics and it is uncertain whether the convolutional layer can extract better ones. In addition, the policy that weights are shared across a whole image is improper for structured face images. To overcome such limitations, a novel method based on patches of interest, the Patch Attention Layer (PAL) of embedding handcrafted features, is proposed to learn the local shallow facial features of each patch on face images. Firstly, a handcrafted feature, Gabor surface feature (GSF), is extracted by convolving the input face image with a set of predefined Gabor filters. Secondly, the generated feature is segmented as nonoverlapped patches that can capture local shallow features by the strategy of using different local patches with different filters. Then, the weighted shallow features are fed into the remaining convolutional layers to capture high-level features. Our method can be carried out directly on a static image without facial landmark information, and the preprocessing step is very simple. Experiments on four databases show that our method achieved very competitive performance (Extended Cohn–Kanade database (CK+): 98.93%; Oulu-CASIA: 97.57%; Japanese Female Facial Expressions database (JAFFE): 93.38%; and RAF-DB: 86.8%) compared to other state-of-the-art methods.
topic	facial expression recognition patch attention shallow feature feature extraction facial representation convolutional layer
url	https://www.mdpi.com/1424-8220/21/3/833
work_keys_str_mv	AT xingcanliang patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT linsenxu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT jinfuliu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT zhipengliu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT gaoxincheng patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT jiajunxu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition AT leiliu patchattentionlayerofembeddinghandcraftedfeaturesincnnforfacialexpressionrecognition
_version_	1724320369872994304

Patch Attention Layer of Embedding Handcrafted Features in CNN for Facial Expression Recognition

Similar Items