Learning 6D Object Pose Estimation using 3D Object Coordinates
Authors:
Eric Brachmann, Alexander Krull, Frank Michel, Stefan Gumhold, Jamie Shotton, Carsten Rother
Abstract:
This work addresses the problem of estimating the 6D Pose of specific objects from a single RGB-D image. We present a flexible approach that can deal with generic objects, both textured and texture-less. The key new concept is a learned, intermediate representation in form of a dense 3D object coordinate labelling paired with a dense class labelling. We are able to show that for a common dataset with texture-less objects, where template-based techniques are suitable and state of the art, our approach is slightly superior in terms of accuracy. We also demonstrate the benefits of our approach, compared to template-based techniques, in terms of robustness with respect to varying lighting conditions. Towards this end, we contribute a new ground truth dataset with 10k images of 20 objects captured each under three different lighting conditions. We demonstrate that our approach scales well with the number of objects and has capabilities to run fast.
Results:
![Pose estimation result on our dataset.](https://tu-dresden.de/ing/informatik/smt/cgv/ressourcen/bilder/forschung/forschungsfelder/poseest/poseestresult1/@@images/840cc33d-0c7d-41d4-b635-64d7dbddea3a.png)
Two qualitative pose estimation results on our dataset. Left: Input RGB-D frames with estimated pose displayed as blue bounding box, ground truth pose as green bounding box. Right: Object coordinate prediction of one tree. The upper inlay shows the ground truth object coordinates. The lower inlay shows for each pixel the best object coordinate prediction of all trees with respect to ground truth.
Dataset:
Coming soon...
Publication:
Eric Brachmann, Alexander Krull, Frank Michel, Stefan Gumhold, Jamie Shotton, and Carsten Rother, "Learning 6D Object Pose Estimation using 3D Object Coordinates", Supplementary Material, ECCV 2014.