Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose

The performance of automated facial expression coding is improving steadily. Advances in deep learning techniques have been key to this success. While the advantage of modern deep learning techniques is clear, the contribution of critical design choices remains largely unknown, especially for facial...

Full description

Bibliographic Details
Main Authors: Koichiro Niinuma, Itir Onal Ertugrul, Jeffrey F. Cohn, László A. Jeni
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-04-01
Series:Frontiers in Computer Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fcomp.2021.636094/full
id doaj-a39b3234119d4381b1b9b0122e0069f6
record_format Article
spelling doaj-a39b3234119d4381b1b9b0122e0069f62021-04-29T08:37:30ZengFrontiers Media S.A.Frontiers in Computer Science2624-98982021-04-01310.3389/fcomp.2021.636094636094Systematic Evaluation of Design Choices for Deep Facial Action Coding Across PoseKoichiro Niinuma0Itir Onal Ertugrul1Jeffrey F. Cohn2László A. Jeni3Fujitsu Laboratories of America, Pittsburgh, PA, United StatesDepartment of Cognitive Science and Artificial Intelligence, Tilburg University, Tilburg, NetherlandsDepartment of Psychology, University of Pittsburgh, Pittsburgh, PA, United StatesRobotics Institute, Carnegie Mellon University, Pittsburgh, PA, United StatesThe performance of automated facial expression coding is improving steadily. Advances in deep learning techniques have been key to this success. While the advantage of modern deep learning techniques is clear, the contribution of critical design choices remains largely unknown, especially for facial action unit occurrence and intensity across pose. Using the The Facial Expression Recognition and Analysis 2017 (FERA 2017) database, which provides a common protocol to evaluate robustness to pose variation, we systematically evaluated design choices in pre-training, feature alignment, model size selection, and optimizer details. Informed by the findings, we developed an architecture that exceeds state-of-the-art on FERA 2017. The architecture achieved a 3.5% increase in F1 score for occurrence detection and a 5.8% increase in Intraclass Correlation (ICC) for intensity estimation. To evaluate the generalizability of the architecture to unseen poses and new dataset domains, we performed experiments across pose in FERA 2017 and across domains in Denver Intensity of Spontaneous Facial Action (DISFA) and the UNBC Pain Archive.https://www.frontiersin.org/articles/10.3389/fcomp.2021.636094/fullaction unitfacial expression codingdesign choice in deep learningAU intensity estimationAU occurrence detectioncross-pose evaluation
collection DOAJ
language English
format Article
sources DOAJ
author Koichiro Niinuma
Itir Onal Ertugrul
Jeffrey F. Cohn
László A. Jeni
spellingShingle Koichiro Niinuma
Itir Onal Ertugrul
Jeffrey F. Cohn
László A. Jeni
Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose
Frontiers in Computer Science
action unit
facial expression coding
design choice in deep learning
AU intensity estimation
AU occurrence detection
cross-pose evaluation
author_facet Koichiro Niinuma
Itir Onal Ertugrul
Jeffrey F. Cohn
László A. Jeni
author_sort Koichiro Niinuma
title Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose
title_short Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose
title_full Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose
title_fullStr Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose
title_full_unstemmed Systematic Evaluation of Design Choices for Deep Facial Action Coding Across Pose
title_sort systematic evaluation of design choices for deep facial action coding across pose
publisher Frontiers Media S.A.
series Frontiers in Computer Science
issn 2624-9898
publishDate 2021-04-01
description The performance of automated facial expression coding is improving steadily. Advances in deep learning techniques have been key to this success. While the advantage of modern deep learning techniques is clear, the contribution of critical design choices remains largely unknown, especially for facial action unit occurrence and intensity across pose. Using the The Facial Expression Recognition and Analysis 2017 (FERA 2017) database, which provides a common protocol to evaluate robustness to pose variation, we systematically evaluated design choices in pre-training, feature alignment, model size selection, and optimizer details. Informed by the findings, we developed an architecture that exceeds state-of-the-art on FERA 2017. The architecture achieved a 3.5% increase in F1 score for occurrence detection and a 5.8% increase in Intraclass Correlation (ICC) for intensity estimation. To evaluate the generalizability of the architecture to unseen poses and new dataset domains, we performed experiments across pose in FERA 2017 and across domains in Denver Intensity of Spontaneous Facial Action (DISFA) and the UNBC Pain Archive.
topic action unit
facial expression coding
design choice in deep learning
AU intensity estimation
AU occurrence detection
cross-pose evaluation
url https://www.frontiersin.org/articles/10.3389/fcomp.2021.636094/full
work_keys_str_mv AT koichironiinuma systematicevaluationofdesignchoicesfordeepfacialactioncodingacrosspose
AT itironalertugrul systematicevaluationofdesignchoicesfordeepfacialactioncodingacrosspose
AT jeffreyfcohn systematicevaluationofdesignchoicesfordeepfacialactioncodingacrosspose
AT laszloajeni systematicevaluationofdesignchoicesfordeepfacialactioncodingacrosspose
_version_ 1721501418511138816