DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation

This paper focuses on 6Dof object pose estimation from a single RGB image. We tackle this challenging problem with a two-stage optimization framework. More specifically, we first introduce a translation estimation module to provide an initial translation based on an estimated depth map. Then, a pose...

Full description

Bibliographic Details
Main Authors: Lei Jin, Xiaojuan Wang, Mingshu He, Jingyue Wang
Format: Article
Language:English
Published: MDPI AG 2021-03-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/21/5/1692
id doaj-1cbb46cba3264b068937a123f031010c
record_format Article
spelling doaj-1cbb46cba3264b068937a123f031010c2021-03-02T00:03:59ZengMDPI AGSensors1424-82202021-03-01211692169210.3390/s21051692DRNet: A Depth-Based Regression Network for 6D Object Pose EstimationLei Jin0Xiaojuan Wang1Mingshu He2Jingyue Wang3School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaThis paper focuses on 6Dof object pose estimation from a single RGB image. We tackle this challenging problem with a two-stage optimization framework. More specifically, we first introduce a translation estimation module to provide an initial translation based on an estimated depth map. Then, a pose regression module combines the ROI (Region of Interest) and the original image to predict the rotation and refine the translation. Compared with previous end-to-end methods that directly predict rotations and translations, our method can utilize depth information as weak guidance and significantly reduce the searching space for the subsequent module. Furthermore, we design a new loss function function for symmetric objects, an approach that has handled such exceptionally difficult cases in prior works. Experiments show that our model achieves state-of-the-art object pose estimation for the YCB- video dataset (Yale-CMU-Berkeley).https://www.mdpi.com/1424-8220/21/5/16926Dof pose estimationrotationstranslations
collection DOAJ
language English
format Article
sources DOAJ
author Lei Jin
Xiaojuan Wang
Mingshu He
Jingyue Wang
spellingShingle Lei Jin
Xiaojuan Wang
Mingshu He
Jingyue Wang
DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation
Sensors
6Dof pose estimation
rotations
translations
author_facet Lei Jin
Xiaojuan Wang
Mingshu He
Jingyue Wang
author_sort Lei Jin
title DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation
title_short DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation
title_full DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation
title_fullStr DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation
title_full_unstemmed DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation
title_sort drnet: a depth-based regression network for 6d object pose estimation
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2021-03-01
description This paper focuses on 6Dof object pose estimation from a single RGB image. We tackle this challenging problem with a two-stage optimization framework. More specifically, we first introduce a translation estimation module to provide an initial translation based on an estimated depth map. Then, a pose regression module combines the ROI (Region of Interest) and the original image to predict the rotation and refine the translation. Compared with previous end-to-end methods that directly predict rotations and translations, our method can utilize depth information as weak guidance and significantly reduce the searching space for the subsequent module. Furthermore, we design a new loss function function for symmetric objects, an approach that has handled such exceptionally difficult cases in prior works. Experiments show that our model achieves state-of-the-art object pose estimation for the YCB- video dataset (Yale-CMU-Berkeley).
topic 6Dof pose estimation
rotations
translations
url https://www.mdpi.com/1424-8220/21/5/1692
work_keys_str_mv AT leijin drnetadepthbasedregressionnetworkfor6dobjectposeestimation
AT xiaojuanwang drnetadepthbasedregressionnetworkfor6dobjectposeestimation
AT mingshuhe drnetadepthbasedregressionnetworkfor6dobjectposeestimation
AT jingyuewang drnetadepthbasedregressionnetworkfor6dobjectposeestimation
_version_ 1724245414410977280