A new benchmark for pose estimation with ground truth from virtual reality

Christian Schlette*, Anders Glent Buch, Eren Erdal Aksoy, Thomas Steil, Jérémie Papon, Thiusius Rajeeth Savarimuthu, Florentin Wörgötter, Norbert Krüger, Jürgen Roßmann

*Kontaktforfatter

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

644 Downloads (Pure)

Abstract

The development of programming paradigms for industrial assembly currently gets fresh impetus from approaches in human demonstration and programming-by-demonstration. Major low- and mid-level prerequisites for machine vision and learning in these intelligent robotic applications are pose estimation, stereo reconstruction and action recognition. As a basis for the machine vision and learning involved, pose estimation is used for deriving object positions and orientations and thus target frames for robot execution. Our contribution introduces and applies a novel benchmark for typical multi-sensor setups and algorithms in the field of demonstration-based automated assembly. The benchmark platform is equipped with a multi-sensor setup consisting of stereo cameras and depth scanning devices (see Fig. 1). The dimensions and abilities of the platform have been chosen in order to reflect typical manual assembly tasks. Following the eRobotics methodology, a simulatable 3D representation of this platform was modelled in virtual reality. Based on a detailed camera and sensor simulation, we generated a set of benchmark images and point clouds with controlled levels of noise as well as ground truth data such as object positions and time stamps. We demonstrate the application of the benchmark to evaluate our latest developments in pose estimation, stereo reconstruction and action recognition and publish the benchmark data for objective comparison of sensor setups and algorithms in industry.

OriginalsprogEngelsk
TidsskriftProduction Engineering
Vol/bind8
Udgave nummer6
Sider (fra-til)745-754
ISSN0944-6524
DOI
StatusUdgivet - 2014

Fingeraftryk

Dyk ned i forskningsemnerne om 'A new benchmark for pose estimation with ground truth from virtual reality'. Sammen danner de et unikt fingeraftryk.

Citationsformater