Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose
The performance of automated facial expression coding is improving steadily. Advances in deep learning techniques have been key to this success. While the advantage of modern deep learning techniques is clear, the contribution of critical design choices remains largely unknown, especially for facial...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2021-04-01
|
Series: | Frontiers in Computer Science |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fcomp.2021.636094/full |
id |
doaj-a39b3234119d4381b1b9b0122e0069f6 |
---|---|
record_format |
Article |
spelling |
doaj-a39b3234119d4381b1b9b0122e0069f62021-04-29T08:37:30ZengFrontiers Media S.A.Frontiers in Computer Science2624-98982021-04-01310.3389/fcomp.2021.636094636094Systematic Evaluation of Design Choices for Deep Facial Action Coding Across PoseKoichiro Niinuma0Itir Onal Ertugrul1Jeffrey F. Cohn2László A. Jeni3Fujitsu Laboratories of America, Pittsburgh, PA, United StatesDepartment of Cognitive Science and Artificial Intelligence, Tilburg University, Tilburg, NetherlandsDepartment of Psychology, University of Pittsburgh, Pittsburgh, PA, United StatesRobotics Institute, Carnegie Mellon University, Pittsburgh, PA, United StatesThe performance of automated facial expression coding is improving steadily. Advances in deep learning techniques have been key to this success. While the advantage of modern deep learning techniques is clear, the contribution of critical design choices remains largely unknown, especially for facial action unit occurrence and intensity across pose. Using the The Facial Expression Recognition and Analysis 2017 (FERA 2017) database, which provides a common protocol to evaluate robustness to pose variation, we systematically evaluated design choices in pre-training, feature alignment, model size selection, and optimizer details. Informed by the findings, we developed an architecture that exceeds state-of-the-art on FERA 2017. The architecture achieved a 3.5% increase in F1 score for occurrence detection and a 5.8% increase in Intraclass Correlation (ICC) for intensity estimation. To evaluate the generalizability of the architecture to unseen poses and new dataset domains, we performed experiments across pose in FERA 2017 and across domains in Denver Intensity of Spontaneous Facial Action (DISFA) and the UNBC Pain Archive.https://www.frontiersin.org/articles/10.3389/fcomp.2021.636094/fullaction unitfacial expression codingdesign choice in deep learningAU intensity estimationAU occurrence detectioncross-pose evaluation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Koichiro Niinuma Itir Onal Ertugrul Jeffrey F. Cohn László A. Jeni |
spellingShingle |
Koichiro Niinuma Itir Onal Ertugrul Jeffrey F. Cohn László A. Jeni Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose Frontiers in Computer Science action unit facial expression coding design choice in deep learning AU intensity estimation AU occurrence detection cross-pose evaluation |
author_facet |
Koichiro Niinuma Itir Onal Ertugrul Jeffrey F. Cohn László A. Jeni |
author_sort |
Koichiro Niinuma |
title |
Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose |
title_short |
Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose |
title_full |
Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose |
title_fullStr |
Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose |
title_full_unstemmed |
Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose |
title_sort |
systematic evaluation of design choices for deep facial action coding across pose |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Computer Science |
issn |
2624-9898 |
publishDate |
2021-04-01 |
description |
The performance of automated facial expression coding is improving steadily. Advances in deep learning techniques have been key to this success. While the advantage of modern deep learning techniques is clear, the contribution of critical design choices remains largely unknown, especially for facial action unit occurrence and intensity across pose. Using the The Facial Expression Recognition and Analysis 2017 (FERA 2017) database, which provides a common protocol to evaluate robustness to pose variation, we systematically evaluated design choices in pre-training, feature alignment, model size selection, and optimizer details. Informed by the findings, we developed an architecture that exceeds state-of-the-art on FERA 2017. The architecture achieved a 3.5% increase in F1 score for occurrence detection and a 5.8% increase in Intraclass Correlation (ICC) for intensity estimation. To evaluate the generalizability of the architecture to unseen poses and new dataset domains, we performed experiments across pose in FERA 2017 and across domains in Denver Intensity of Spontaneous Facial Action (DISFA) and the UNBC Pain Archive. |
topic |
action unit facial expression coding design choice in deep learning AU intensity estimation AU occurrence detection cross-pose evaluation |
url |
https://www.frontiersin.org/articles/10.3389/fcomp.2021.636094/full |
work_keys_str_mv |
AT koichironiinuma systematicevaluationofdesignchoicesfordeepfacialactioncodingacrosspose AT itironalertugrul systematicevaluationofdesignchoicesfordeepfacialactioncodingacrosspose AT jeffreyfcohn systematicevaluationofdesignchoicesfordeepfacialactioncodingacrosspose AT laszloajeni systematicevaluationofdesignchoicesfordeepfacialactioncodingacrosspose |
_version_ |
1721501418511138816 |