This paper proposes a methodology for optimizing HVAC control in multi-zone buildings using Deep Reinforce- ment Learning. The study focuses on optimizing the central AHU system by controlling all low-level components within both the air and water loops, addressing the complex dynamics of multi-zone interactions. The case study is based on a building within the Politecnico di Torino campus. Modelica-based simulations are used to model both the HVAC system and building dynamics, allowing the integration and evaluation of the ASHRAE G36 con- trol standard as a benchmark. Two DRL strategies are developed and evaluated, Zone-Aware and Zone-Integrated, under both winter and summer conditions, with the goal of improving energy efficiency, indoor temperature con- trol, and indoor CO2 concentration, under varying occupancy profiles. The results reveal that both DRL strategies outperform the G36 baseline in terms of energy savings (up to 17 %), indoor temperature violations, and CO2 concentration. Additionally, DRL controllers demonstrate strong generalizability and adapt seamlessly to unseen occupancy profiles without manual tuning. This research highlights the potential of DRL to provide scalable, adaptive, and energy-efficient HVAC control solutions for multi-zone buildings.

Deploying deep reinforcement learning for low-level HVAC control in multi-zone buildings: A comparative study with ASHRAE G36 sequences / Savino, Sabrina; Razzano, Giuseppe; Pagone, Michele; Novara, Carlo; Capozzoli, Alfonso. - In: ENERGY AND BUILDINGS. - ISSN 0378-7788. - 348:(2025). [10.1016/j.enbuild.2025.116456]

Deploying deep reinforcement learning for low-level HVAC control in multi-zone buildings: A comparative study with ASHRAE G36 sequences

Savino, Sabrina;Razzano, Giuseppe;Pagone, Michele;Novara, Carlo;Capozzoli, Alfonso
2025

Abstract

This paper proposes a methodology for optimizing HVAC control in multi-zone buildings using Deep Reinforce- ment Learning. The study focuses on optimizing the central AHU system by controlling all low-level components within both the air and water loops, addressing the complex dynamics of multi-zone interactions. The case study is based on a building within the Politecnico di Torino campus. Modelica-based simulations are used to model both the HVAC system and building dynamics, allowing the integration and evaluation of the ASHRAE G36 con- trol standard as a benchmark. Two DRL strategies are developed and evaluated, Zone-Aware and Zone-Integrated, under both winter and summer conditions, with the goal of improving energy efficiency, indoor temperature con- trol, and indoor CO2 concentration, under varying occupancy profiles. The results reveal that both DRL strategies outperform the G36 baseline in terms of energy savings (up to 17 %), indoor temperature violations, and CO2 concentration. Additionally, DRL controllers demonstrate strong generalizability and adapt seamlessly to unseen occupancy profiles without manual tuning. This research highlights the potential of DRL to provide scalable, adaptive, and energy-efficient HVAC control solutions for multi-zone buildings.
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0378778825011867-main_compressed.pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 1.95 MB
Formato Adobe PDF
1.95 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3003430