Performance of deep reinforcement learning algorithms in two-echelon inventory control systems

Stranieri, Francesco; Stella, Fabio; Kouki, Chaaben

doi:10.1080/00207543.2024.2311180

This study conducts a comprehensive analysis of deep reinforcement learning (DRL) algorithms applied to supply chain inventory management (SCIM), which consists of a sequential decision-making problem focussed on determining the optimal quantity of products to produce and ship across multiple capacitated local warehouses over a specific time horizon. In detail, we formulate the problem as a Markov decision process for a divergent two-echelon inventory control system characterised by stochastic and seasonal demand, also presenting a balanced allocation rule designed to prevent backorders in the first echelon. Through numerical experiments, we evaluate the performance of state-of-the-art DRL algorithms and static inventory policies in terms of both cost minimisation and training time while varying the number of local warehouses and product types and the length of replenishment lead times. Our results reveal that the Proximal Policy Optimization algorithm consistently outperforms other algorithms across all experiments, proving to be a robust method for tackling the SCIM problem. Furthermore, the (s, Q)-policy stands as a solid alternative, offering a compromise in performance and computational efficiency. Lastly, this study presents an open-source software library that provides a customisable simulation environment for addressing the SCIM problem, utilising a wide range of DRL algorithms and static inventory policies.

Performance of deep reinforcement learning algorithms in two-echelon inventory control systems / Stranieri, Francesco; Stella, Fabio; Kouki, Chaaben. - In: INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH. - ISSN 0020-7543. - ELETTRONICO. - 62:17(2024), pp. 6211-6226. [10.1080/00207543.2024.2311180]

Performance of deep reinforcement learning algorithms in two-echelon inventory control systems

Stranieri, Francesco;Stella, Fabio;Kouki, Chaaben

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2024
			
	Codice DOI
	
				https://dx.doi.org/10.1080/00207543.2024.2311180
			
	Titolo della Rivista
	
				INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Performance of deep reinforcement learning algorithms in two-echelon inventory control systems.pdf accesso riservato Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Non Pubblico - Accesso privato/ristretto Dimensione 2.14 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.14 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
Accepted_Performance_of Deep_ Reinforcement_Learning_Algorithms_in_Two-Echelon_Inventory_Control_Systems.pdf Open Access dal 02/03/2025 Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 849.56 kB Formato Adobe PDF Visualizza/Apri	849.56 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2986524

PORTO @ Archivio Istituzionale della Ricerca

Performance of deep reinforcement learning algorithms in two-echelon inventory control systems

Stranieri, Francesco;Stella, Fabio;Kouki, Chaaben

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)