In recent years, a significant transformation towards intelligent manufacturing systems has been observed in industry. One of the leading research topics in this field is collaborative robotics, which promotes a synergic interaction between humans and robots. Advantages in ergonomics and production are foreseen with the adoption of collaborative robotics. Avoiding unintended collisions, which would ensure seamless collaboration, is one of the main challenges in improving safety and productivity. This paper focuses on a decision-making strategy that allows the robot to autonomously identify the optimal path to minimize the travel distance between the current configuration and the target while maintaining a safe distance from the human collaborator. The proposed strategy involves the offline generation of a dataset of possible paths within the robot workspace and a Reinforcement Learning-based control strategy, enabling the optimal choice of the subsequent robot configuration. After training and testing in a simulated environment, the optimal policy was validated with an ABB GoFa™ robotic arm, testing different human configurations and paths.
Combined Bi-RRT and Q-Learning path-planning in collaborative environments / Pelosi, Martina; Grieco, Bianca; Zanchettin, Andrea Maria; Rocco, Paolo. - 59 (18):(2025), pp. 289-294. (Intervento presentato al convegno 14th IFAC Symposium on Robotics tenutosi a Paris (FRA) nel July 15-18, 2025) [10.1016/j.ifacol.2025.10.235].
Combined Bi-RRT and Q-Learning path-planning in collaborative environments
Pelosi, Martina;
2025
Abstract
In recent years, a significant transformation towards intelligent manufacturing systems has been observed in industry. One of the leading research topics in this field is collaborative robotics, which promotes a synergic interaction between humans and robots. Advantages in ergonomics and production are foreseen with the adoption of collaborative robotics. Avoiding unintended collisions, which would ensure seamless collaboration, is one of the main challenges in improving safety and productivity. This paper focuses on a decision-making strategy that allows the robot to autonomously identify the optimal path to minimize the travel distance between the current configuration and the target while maintaining a safe distance from the human collaborator. The proposed strategy involves the offline generation of a dataset of possible paths within the robot workspace and a Reinforcement Learning-based control strategy, enabling the optimal choice of the subsequent robot configuration. After training and testing in a simulated environment, the optimal policy was validated with an ABB GoFa™ robotic arm, testing different human configurations and paths.| File | Dimensione | Formato | |
|---|---|---|---|
|
Post_print_editor.pdf
accesso aperto
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Creative commons
Dimensione
1.6 MB
Formato
Adobe PDF
|
1.6 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/3005150
