This article conducts a preliminary exploration of an innovative Reinforcement Learning RL-based control system applied to the Torque Allocation problem in a fully electric All-Wheel-Drive vehicle. The investigation delves into the untapped degrees of freedom in four-motor Electric Vehicles beyond total torque request and Torque Vectoring bias. Utilizing a Deep Deterministic Policy Gradient (DDPG) agent, the RL architecture is implemented within MATLAB/Simulink, incorporating co-simulation with VI-CarRealTime for vehicle dynamics. Comparative analysis against reference Torque Allocation strategies (open differential, FWD, RWD) is performed, assessing key performance factors in the reward function. The most successful RL system, trained with Second Order Sliding Mode Suboptimal torque vectoring algorithm, surpasses the average performance of reference strategies. Nevertheless, challenges such as marginal advantages, repeatability issues, prolonged training durations, and a lack of interpretability are noted.

Reinforcement learning based control for torque allocation in electric vehicles: a preliminary analysis / De Carvalho Pinheiro, H.; Carello, M.. - ELETTRONICO. - 1:(2024), pp. 1-6. (Intervento presentato al convegno 2024 International Conference on Artificial Intelligence, Computer, Data Sciences, and Applications, ACDSA 2024 tenutosi a Victoria, Seychelles nel 01-02 February 2024) [10.1109/ACDSA59508.2024.10467491].

Reinforcement learning based control for torque allocation in electric vehicles: a preliminary analysis

De Carvalho Pinheiro H.;Carello M.
2024

Abstract

This article conducts a preliminary exploration of an innovative Reinforcement Learning RL-based control system applied to the Torque Allocation problem in a fully electric All-Wheel-Drive vehicle. The investigation delves into the untapped degrees of freedom in four-motor Electric Vehicles beyond total torque request and Torque Vectoring bias. Utilizing a Deep Deterministic Policy Gradient (DDPG) agent, the RL architecture is implemented within MATLAB/Simulink, incorporating co-simulation with VI-CarRealTime for vehicle dynamics. Comparative analysis against reference Torque Allocation strategies (open differential, FWD, RWD) is performed, assessing key performance factors in the reward function. The most successful RL system, trained with Second Order Sliding Mode Suboptimal torque vectoring algorithm, surpasses the average performance of reference strategies. Nevertheless, challenges such as marginal advantages, repeatability issues, prolonged training durations, and a lack of interpretability are noted.
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2988264
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo