Robot Movement Using the Trust Region Policy Optimization

ALI MOUHAMED ALI, Romisaa

doi:10.5281/zenodo.10045678

The Policy Gradient approach is a subset of the Deep Reinforcement Learning (DRL) combines Deep Neural Networks (DNN) with Reinforcement Learning (RL). This approach finds the optimal policy of robot movement, based on the experience it gains from interaction with its environment. Unlike previous policy gradient algorithms, which were unable to handle the two types of error variance and bias introduced by the DNN model due to over- or underestimation, this algorithm is capable of handling both types of error variance and bias. This article will discuss the state-of-the-art SOTA policy gradient technique, trust region policy optimization (TRPO), by applying this method in various environments compared to another policy gradient method, the Proximal Policy Optimization (PPO), to explain their robust optimization, using this SOTA to gather experience data during various training phases after observing the impact of hyper-parameters on neural network performance.

Robot Movement Using the Trust Region Policy Optimization / ALI MOUHAMED ALI, Romisaa. - ELETTRONICO. - Vol:17:(2023), pp. 394-399. (Intervento presentato al convegno ICRMCA 2023: 17. International Conference on Robot Motion Control and Automation January 16-17, 2023 in Rome, Italy tenutosi a Rome (Italy)) [10.5281/zenodo.10045678].

Robot Movement Using the Trust Region Policy Optimization

Romisaa, Ali

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Anno del prodotto

2023

Appare nelle tipologie

4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
robot-movement-using-the-trust-region-policy-optimization.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 397.81 kB Formato Adobe PDF Visualizza/Apri	397.81 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2983363

PORTO @ Archivio Istituzionale della Ricerca

Robot Movement Using the Trust Region Policy Optimization

Romisaa, Ali

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)