An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation
Image-based Head Pose Estimation (HPE) from an arbitrary view is still challenging due to the complex imaging conditions as well as the intrinsic and extrinsic property of the faces. Different from existing HPE methods combining additional cues or tasks, this paper solves the HPE problem by relievin...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9019692/ |
id |
doaj-3d216a7ffd0242299f9e6569ca56ba9e |
---|---|
record_format |
Article |
spelling |
doaj-3d216a7ffd0242299f9e6569ca56ba9e2021-03-30T02:09:19ZengIEEEIEEE Access2169-35362020-01-018424584246810.1109/ACCESS.2020.29773469019692An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose EstimationJing Li0https://orcid.org/0000-0002-4464-1008Jiang Wang1https://orcid.org/0000-0003-0012-1729Farhan Ullah2https://orcid.org/0000-0003-2422-575XCollege of Computer Science, Sichuan University, Chengdu, ChinaCollege of Computer Science, Sichuan University, Chengdu, ChinaCollege of Computer Science, Sichuan University, Chengdu, ChinaImage-based Head Pose Estimation (HPE) from an arbitrary view is still challenging due to the complex imaging conditions as well as the intrinsic and extrinsic property of the faces. Different from existing HPE methods combining additional cues or tasks, this paper solves the HPE problem by relieving problem complexity. Our method integrates the deep Task-Simplification oriented Image Regularization (TSIR) module with the Anchor-Guided Pose Estimation (AGPE) module, and formulate the HPE problem into a unified end-to-end learning framework. In this paper, we define anchors as images that strictly obey the “gravity rule in camera”, which follows the assumption that camera coordinate of the vertical axis should always be consistent with that of the local head coordinate. We formulate image pair as the regularized image produced by TSIR along with its anchor counterpart, both of which are fed into the AGPE module for estimating fine-grained head poses. This paper also proposes an Anchor-Guided Pairwise Loss (AGPL), which describes the interdependent relevance of poses between each pair of images. The proposed method is evaluated and validated with sufficient experiments which show its effectiveness. Comprehensive experiments show that our approach outperforms the state-of-the-art image-based methods on both indoor and outdoor datasets.https://ieeexplore.ieee.org/document/9019692/Head pose estimationtask-simplification oriented image regularizationanchor-guided pose estimationanchor-guided pairwise lossdeep learning framework |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jing Li Jiang Wang Farhan Ullah |
spellingShingle |
Jing Li Jiang Wang Farhan Ullah An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation IEEE Access Head pose estimation task-simplification oriented image regularization anchor-guided pose estimation anchor-guided pairwise loss deep learning framework |
author_facet |
Jing Li Jiang Wang Farhan Ullah |
author_sort |
Jing Li |
title |
An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation |
title_short |
An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation |
title_full |
An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation |
title_fullStr |
An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation |
title_full_unstemmed |
An End-to-End Task-Simplified and Anchor-Guided Deep Learning Framework for Image-Based Head Pose Estimation |
title_sort |
end-to-end task-simplified and anchor-guided deep learning framework for image-based head pose estimation |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Image-based Head Pose Estimation (HPE) from an arbitrary view is still challenging due to the complex imaging conditions as well as the intrinsic and extrinsic property of the faces. Different from existing HPE methods combining additional cues or tasks, this paper solves the HPE problem by relieving problem complexity. Our method integrates the deep Task-Simplification oriented Image Regularization (TSIR) module with the Anchor-Guided Pose Estimation (AGPE) module, and formulate the HPE problem into a unified end-to-end learning framework. In this paper, we define anchors as images that strictly obey the “gravity rule in camera”, which follows the assumption that camera coordinate of the vertical axis should always be consistent with that of the local head coordinate. We formulate image pair as the regularized image produced by TSIR along with its anchor counterpart, both of which are fed into the AGPE module for estimating fine-grained head poses. This paper also proposes an Anchor-Guided Pairwise Loss (AGPL), which describes the interdependent relevance of poses between each pair of images. The proposed method is evaluated and validated with sufficient experiments which show its effectiveness. Comprehensive experiments show that our approach outperforms the state-of-the-art image-based methods on both indoor and outdoor datasets. |
topic |
Head pose estimation task-simplification oriented image regularization anchor-guided pose estimation anchor-guided pairwise loss deep learning framework |
url |
https://ieeexplore.ieee.org/document/9019692/ |
work_keys_str_mv |
AT jingli anendtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation AT jiangwang anendtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation AT farhanullah anendtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation AT jingli endtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation AT jiangwang endtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation AT farhanullah endtoendtasksimplifiedandanchorguideddeeplearningframeworkforimagebasedheadposeestimation |
_version_ |
1724185720981028864 |