Expert Initialized Reinforcement Learning with Application to Robotic Assembly

Jeppe Langaa*, Christoffer Sloth*

*Kontaktforfatter

Publikation: Kapitel i bog/rapport/konference-proceedingKonferencebidrag i proceedingsForskningpeer review

Abstract

This paper investigates the advantages and boundaries of actor-critic reinforcement learning algorithms in an industrial setting. We compare and discuss Cycle of Learning, Deep Deterministic Policy Gradient and Twin Delayed Deep Deterministic Policy Gradient with respect to performance in simulation as well as on a real robot setup. Furthermore, it emphasizes the importance and potential of combining demonstrated expert behavior with the actor-critic reinforcement learning setting while using it with an admittance controller to solve an industrial assembly task. Cycle of Learning and Twin Delayed Deep Deterministic Policy Gradient showed to be equally usable in simulation, while Cycle of Learning proved to be best on a real world application due to the behavior cloning loss that enables the agent to learn rapidly. The results also demonstrated that it is a necessity to incorporate an admittance controller in order to transfer the learned behavior to a real robot.

OriginalsprogEngelsk
Titel2022 IEEE 18th International Conference on Automation Science and Engineering (CASE)
ForlagIEEE Computer Society
Publikationsdato2022
Sider1405-1410
ISBN (Elektronisk)9781665490429
DOI
StatusUdgivet - 2022
Begivenhed18th IEEE International Conference on Automation Science and Engineering, CASE 2022 - Mexico City, Mexico
Varighed: 20. aug. 202224. aug. 2022

Konference

Konference18th IEEE International Conference on Automation Science and Engineering, CASE 2022
Land/OmrådeMexico
ByMexico City
Periode20/08/202224/08/2022
NavnProceedings - IEEE International Conference on Automation Science and Engineering
Vol/bind2022-August
ISSN2161-8070

Bibliografisk note

Funding Information:
*This work was supported by the PIRAT project, funded by Innovation Fund Denmark, grant number 9069-00046B. All authors are with the Maersk McKinney Moller Institute, University of Southern Denmark, Odense, Denmark

Publisher Copyright:
© 2022 IEEE.

Fingeraftryk

Dyk ned i forskningsemnerne om 'Expert Initialized Reinforcement Learning with Application to Robotic Assembly'. Sammen danner de et unikt fingeraftryk.

Citationsformater