DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation
This paper focuses on 6Dof object pose estimation from a single RGB image. We tackle this challenging problem with a two-stage optimization framework. More specifically, we first introduce a translation estimation module to provide an initial translation based on an estimated depth map. Then, a pose...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-03-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/21/5/1692 |
id |
doaj-1cbb46cba3264b068937a123f031010c |
---|---|
record_format |
Article |
spelling |
doaj-1cbb46cba3264b068937a123f031010c2021-03-02T00:03:59ZengMDPI AGSensors1424-82202021-03-01211692169210.3390/s21051692DRNet: A Depth-Based Regression Network for 6D Object Pose EstimationLei Jin0Xiaojuan Wang1Mingshu He2Jingyue Wang3School of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaThis paper focuses on 6Dof object pose estimation from a single RGB image. We tackle this challenging problem with a two-stage optimization framework. More specifically, we first introduce a translation estimation module to provide an initial translation based on an estimated depth map. Then, a pose regression module combines the ROI (Region of Interest) and the original image to predict the rotation and refine the translation. Compared with previous end-to-end methods that directly predict rotations and translations, our method can utilize depth information as weak guidance and significantly reduce the searching space for the subsequent module. Furthermore, we design a new loss function function for symmetric objects, an approach that has handled such exceptionally difficult cases in prior works. Experiments show that our model achieves state-of-the-art object pose estimation for the YCB- video dataset (Yale-CMU-Berkeley).https://www.mdpi.com/1424-8220/21/5/16926Dof pose estimationrotationstranslations |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lei Jin Xiaojuan Wang Mingshu He Jingyue Wang |
spellingShingle |
Lei Jin Xiaojuan Wang Mingshu He Jingyue Wang DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation Sensors 6Dof pose estimation rotations translations |
author_facet |
Lei Jin Xiaojuan Wang Mingshu He Jingyue Wang |
author_sort |
Lei Jin |
title |
DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation |
title_short |
DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation |
title_full |
DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation |
title_fullStr |
DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation |
title_full_unstemmed |
DRNet: A Depth-Based Regression Network for 6D Object Pose Estimation |
title_sort |
drnet: a depth-based regression network for 6d object pose estimation |
publisher |
MDPI AG |
series |
Sensors |
issn |
1424-8220 |
publishDate |
2021-03-01 |
description |
This paper focuses on 6Dof object pose estimation from a single RGB image. We tackle this challenging problem with a two-stage optimization framework. More specifically, we first introduce a translation estimation module to provide an initial translation based on an estimated depth map. Then, a pose regression module combines the ROI (Region of Interest) and the original image to predict the rotation and refine the translation. Compared with previous end-to-end methods that directly predict rotations and translations, our method can utilize depth information as weak guidance and significantly reduce the searching space for the subsequent module. Furthermore, we design a new loss function function for symmetric objects, an approach that has handled such exceptionally difficult cases in prior works. Experiments show that our model achieves state-of-the-art object pose estimation for the YCB- video dataset (Yale-CMU-Berkeley). |
topic |
6Dof pose estimation rotations translations |
url |
https://www.mdpi.com/1424-8220/21/5/1692 |
work_keys_str_mv |
AT leijin drnetadepthbasedregressionnetworkfor6dobjectposeestimation AT xiaojuanwang drnetadepthbasedregressionnetworkfor6dobjectposeestimation AT mingshuhe drnetadepthbasedregressionnetworkfor6dobjectposeestimation AT jingyuewang drnetadepthbasedregressionnetworkfor6dobjectposeestimation |
_version_ |
1724245414410977280 |