Object Pose Estimation Incorporating Projection Loss and Discriminative Refinement

The accurate estimation of three-dimensional (3D) object pose is important in a wide range of applications, such as robotics and augmented reality. The key to estimate object poses is matching feature points in the captured image with predefined ones of the 3D model of the object. Existing learning-...

Full description

Bibliographic Details
Main Authors:	Jiun-Kai You, Chen-Chien James Hsu, Wei-Yen Wang, Shao-Kang Huang
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Object pose estimation LINEMOD occlusion LINEMOD deep learning convolutional neural network
Online Access:	https://ieeexplore.ieee.org/document/9335573/

id	doaj-dd34c3c508e348f5a30f63729d4762d2
record_format	Article
spelling	doaj-dd34c3c508e348f5a30f63729d4762d22021-03-30T15:19:33ZengIEEEIEEE Access2169-35362021-01-019185971860610.1109/ACCESS.2021.30544939335573Object Pose Estimation Incorporating Projection Loss and Discriminative RefinementJiun-Kai You0Chen-Chien James Hsu1https://orcid.org/0000-0002-3697-8401Wei-Yen Wang2https://orcid.org/0000-0003-1579-8265Shao-Kang Huang3Department of Electrical Engineering, National Taiwan Normal University, Taipei, TaiwanDepartment of Electrical Engineering, National Taiwan Normal University, Taipei, TaiwanDepartment of Electrical Engineering, National Taiwan Normal University, Taipei, TaiwanDepartment of Electrical Engineering, National Taiwan Normal University, Taipei, TaiwanThe accurate estimation of three-dimensional (3D) object pose is important in a wide range of applications, such as robotics and augmented reality. The key to estimate object poses is matching feature points in the captured image with predefined ones of the 3D model of the object. Existing learning-based pose estimation systems utilize a voting strategy to estimate the feature points in a vector space for improving the accuracy of the estimated pose. However, the loss function of such approaches only takes account of the direction of the vector, resulting in an error-prone localization of feature points. Therefore, this paper considers a projection loss function dealing with the error of the vector field and incorporates a refinement network to revise the predicted pose to obtain a good final output. Experimental results show that the proposed methods outperform the state-of-the-art methods in ADD(-S) metric on the LINEMOD and Occlusion LINEMOD datasets. Moreover, the proposed method can be applied to real-world practical scenarios in real time to simultaneously estimate the poses of multiple objects.https://ieeexplore.ieee.org/document/9335573/Object pose estimationLINEMODocclusion LINEMODdeep learningconvolutional neural network
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jiun-Kai You Chen-Chien James Hsu Wei-Yen Wang Shao-Kang Huang
spellingShingle	Jiun-Kai You Chen-Chien James Hsu Wei-Yen Wang Shao-Kang Huang Object Pose Estimation Incorporating Projection Loss and Discriminative Refinement IEEE Access Object pose estimation LINEMOD occlusion LINEMOD deep learning convolutional neural network
author_facet	Jiun-Kai You Chen-Chien James Hsu Wei-Yen Wang Shao-Kang Huang
author_sort	Jiun-Kai You
title	Object Pose Estimation Incorporating Projection Loss and Discriminative Refinement
title_short	Object Pose Estimation Incorporating Projection Loss and Discriminative Refinement
title_full	Object Pose Estimation Incorporating Projection Loss and Discriminative Refinement
title_fullStr	Object Pose Estimation Incorporating Projection Loss and Discriminative Refinement
title_full_unstemmed	Object Pose Estimation Incorporating Projection Loss and Discriminative Refinement
title_sort	object pose estimation incorporating projection loss and discriminative refinement
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	The accurate estimation of three-dimensional (3D) object pose is important in a wide range of applications, such as robotics and augmented reality. The key to estimate object poses is matching feature points in the captured image with predefined ones of the 3D model of the object. Existing learning-based pose estimation systems utilize a voting strategy to estimate the feature points in a vector space for improving the accuracy of the estimated pose. However, the loss function of such approaches only takes account of the direction of the vector, resulting in an error-prone localization of feature points. Therefore, this paper considers a projection loss function dealing with the error of the vector field and incorporates a refinement network to revise the predicted pose to obtain a good final output. Experimental results show that the proposed methods outperform the state-of-the-art methods in ADD(-S) metric on the LINEMOD and Occlusion LINEMOD datasets. Moreover, the proposed method can be applied to real-world practical scenarios in real time to simultaneously estimate the poses of multiple objects.
topic	Object pose estimation LINEMOD occlusion LINEMOD deep learning convolutional neural network
url	https://ieeexplore.ieee.org/document/9335573/
work_keys_str_mv	AT jiunkaiyou objectposeestimationincorporatingprojectionlossanddiscriminativerefinement AT chenchienjameshsu objectposeestimationincorporatingprojectionlossanddiscriminativerefinement AT weiyenwang objectposeestimationincorporatingprojectionlossanddiscriminativerefinement AT shaokanghuang objectposeestimationincorporatingprojectionlossanddiscriminativerefinement
_version_	1724179673145933824

Object Pose Estimation Incorporating Projection Loss and Discriminative Refinement

Similar Items