An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation

Image-based Head Pose Estimation (HPE) from an arbitrary view is still challenging due to the complex imaging conditions as well as the intrinsic and extrinsic property of the faces. Different from existing HPE methods combining additional cues or tasks, this paper solves the HPE problem by relievin...

Full description

Bibliographic Details
Main Authors: Jing Li, Jiang Wang, Farhan Ullah
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9019692/
id doaj-3d216a7ffd0242299f9e6569ca56ba9e
record_format Article
spelling doaj-3d216a7ffd0242299f9e6569ca56ba9e2021-03-30T02:09:19ZengIEEEIEEE Access2169-35362020-01-018424584246810.1109/ACCESS.2020.29773469019692An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose EstimationJing Li0https://orcid.org/0000-0002-4464-1008Jiang Wang1https://orcid.org/0000-0003-0012-1729Farhan Ullah2https://orcid.org/0000-0003-2422-575XCollege of Computer Science, Sichuan University, Chengdu, ChinaCollege of Computer Science, Sichuan University, Chengdu, ChinaCollege of Computer Science, Sichuan University, Chengdu, ChinaImage-based Head Pose Estimation (HPE) from an arbitrary view is still challenging due to the complex imaging conditions as well as the intrinsic and extrinsic property of the faces. Different from existing HPE methods combining additional cues or tasks, this paper solves the HPE problem by relieving problem complexity. Our method integrates the deep Task-Simplification oriented Image Regularization (TSIR) module with the Anchor-Guided Pose Estimation (AGPE) module, and formulate the HPE problem into a unified end-to-end learning framework. In this paper, we define anchors as images that strictly obey the “gravity rule in camera”, which follows the assumption that camera coordinate of the vertical axis should always be consistent with that of the local head coordinate. We formulate image pair as the regularized image produced by TSIR along with its anchor counterpart, both of which are fed into the AGPE module for estimating fine-grained head poses. This paper also proposes an Anchor-Guided Pairwise Loss (AGPL), which describes the interdependent relevance of poses between each pair of images. The proposed method is evaluated and validated with sufficient experiments which show its effectiveness. Comprehensive experiments show that our approach outperforms the state-of-the-art image-based methods on both indoor and outdoor datasets.https://ieeexplore.ieee.org/document/9019692/Head pose estimationtask-simplification oriented image regularizationanchor-guided pose estimationanchor-guided pairwise lossdeep learning framework
collection DOAJ
language English
format Article
sources DOAJ
author Jing Li
Jiang Wang
Farhan Ullah
spellingShingle Jing Li
Jiang Wang
Farhan Ullah
An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation
IEEE Access
Head pose estimation
task-simplification oriented image regularization
anchor-guided pose estimation
anchor-guided pairwise loss
deep learning framework
author_facet Jing Li
Jiang Wang
Farhan Ullah
author_sort Jing Li
title An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation
title_short An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation
title_full An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation
title_fullStr An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation
title_full_unstemmed An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation
title_sort end-to-end task-simplified and anchor-guided deep learning framework for image-based head pose estimation
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Image-based Head Pose Estimation (HPE) from an arbitrary view is still challenging due to the complex imaging conditions as well as the intrinsic and extrinsic property of the faces. Different from existing HPE methods combining additional cues or tasks, this paper solves the HPE problem by relieving problem complexity. Our method integrates the deep Task-Simplification oriented Image Regularization (TSIR) module with the Anchor-Guided Pose Estimation (AGPE) module, and formulate the HPE problem into a unified end-to-end learning framework. In this paper, we define anchors as images that strictly obey the “gravity rule in camera”, which follows the assumption that camera coordinate of the vertical axis should always be consistent with that of the local head coordinate. We formulate image pair as the regularized image produced by TSIR along with its anchor counterpart, both of which are fed into the AGPE module for estimating fine-grained head poses. This paper also proposes an Anchor-Guided Pairwise Loss (AGPL), which describes the interdependent relevance of poses between each pair of images. The proposed method is evaluated and validated with sufficient experiments which show its effectiveness. Comprehensive experiments show that our approach outperforms the state-of-the-art image-based methods on both indoor and outdoor datasets.
topic Head pose estimation
task-simplification oriented image regularization
anchor-guided pose estimation
anchor-guided pairwise loss
deep learning framework
url https://ieeexplore.ieee.org/document/9019692/
work_keys_str_mv AT jingli anendtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation
AT jiangwang anendtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation
AT farhanullah anendtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation
AT jingli endtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation
AT jiangwang endtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation
AT farhanullah endtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation
_version_ 1724185720981028864