In recent years, a significant transformation towards intelligent manufacturing systems has been observed in industry. One of the leading research topics in this field is collaborative robotics, which promotes a synergic interaction between humans and robots. Advantages in ergonomics and production are foreseen with the adoption of collaborative robotics. Avoiding unintended collisions, which would ensure seamless collaboration, is one of the main challenges in improving safety and productivity. This paper focuses on a decision-making strategy that allows the robot to autonomously identify the optimal path to minimize the travel distance between the current configuration and the target while maintaining a safe distance from the human collaborator. The proposed strategy involves the offline generation of a dataset of possible paths within the robot workspace and a Reinforcement Learning-based control strategy, enabling the optimal choice of the subsequent robot configuration. After training and testing in a simulated environment, the optimal policy was validated with an ABB GoFa™ robotic arm, testing different human configurations and paths.

Combined Bi-RRT and Q-Learning path-planning in collaborative environments / Pelosi, Martina; Grieco, Bianca; Zanchettin, Andrea Maria; Rocco, Paolo. - 59 (18):(2025), pp. 289-294. (Intervento presentato al convegno 14th IFAC Symposium on Robotics tenutosi a Paris (FRA) nel July 15-18, 2025) [10.1016/j.ifacol.2025.10.235].

Combined Bi-RRT and Q-Learning path-planning in collaborative environments

Pelosi, Martina;
2025

Abstract

In recent years, a significant transformation towards intelligent manufacturing systems has been observed in industry. One of the leading research topics in this field is collaborative robotics, which promotes a synergic interaction between humans and robots. Advantages in ergonomics and production are foreseen with the adoption of collaborative robotics. Avoiding unintended collisions, which would ensure seamless collaboration, is one of the main challenges in improving safety and productivity. This paper focuses on a decision-making strategy that allows the robot to autonomously identify the optimal path to minimize the travel distance between the current configuration and the target while maintaining a safe distance from the human collaborator. The proposed strategy involves the offline generation of a dataset of possible paths within the robot workspace and a Reinforcement Learning-based control strategy, enabling the optimal choice of the subsequent robot configuration. After training and testing in a simulated environment, the optimal policy was validated with an ABB GoFa™ robotic arm, testing different human configurations and paths.
2025
File in questo prodotto:
File Dimensione Formato  
Post_print_editor.pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 1.6 MB
Formato Adobe PDF
1.6 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3005150