Abstract
This paper investigates the advantages and boundaries of actor-critic reinforcement learning algorithms in an industrial setting. We compare and discuss Cycle of Learning, Deep Deterministic Policy Gradient and Twin Delayed Deep Deterministic Policy Gradient with respect to performance in simulation as well as on a real robot setup. Furthermore, it emphasizes the importance and potential of combining demonstrated expert behavior with the actor-critic reinforcement learning setting while using it with an admittance controller to solve an industrial assembly task. Cycle of Learning and Twin Delayed Deep Deterministic Policy Gradient showed to be equally usable in simulation, while Cycle of Learning proved to be best on a real world application due to the behavior cloning loss that enables the agent to learn rapidly. The results also demonstrated that it is a necessity to incorporate an admittance controller in order to transfer the learned behavior to a real robot.
Originalsprog | Engelsk |
---|---|
Titel | 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE) |
Forlag | IEEE Computer Society |
Publikationsdato | 2022 |
Sider | 1405-1410 |
ISBN (Elektronisk) | 9781665490429 |
DOI | |
Status | Udgivet - 2022 |
Begivenhed | 18th IEEE International Conference on Automation Science and Engineering, CASE 2022 - Mexico City, Mexico Varighed: 20. aug. 2022 → 24. aug. 2022 |
Konference
Konference | 18th IEEE International Conference on Automation Science and Engineering, CASE 2022 |
---|---|
Land/Område | Mexico |
By | Mexico City |
Periode | 20/08/2022 → 24/08/2022 |
Navn | Proceedings - IEEE International Conference on Automation Science and Engineering |
---|---|
Vol/bind | 2022-August |
ISSN | 2161-8070 |
Bibliografisk note
Funding Information:*This work was supported by the PIRAT project, funded by Innovation Fund Denmark, grant number 9069-00046B. All authors are with the Maersk McKinney Moller Institute, University of Southern Denmark, Odense, Denmark
Publisher Copyright:
© 2022 IEEE.