This work presents a privacy-preserving training framework for household characteristic identification from electricity consumption data. The proposed framework integrates two main components: (i) a synthetic data generation pipeline capable of replicating realistic energy traces from diverse family compositions, capturing fine-grained sociodemographic attributes such as household size, employment status, age groups, and home occupancy patterns; (ii) a training strategy based on Federated Learning (FL) secured with homomorphic encryption, enabling collaborative model training while preserving data ownership. Our synthetic dataset enables the performance assessment of different training scenarios, including siloed model training by individual energy utilities and secure collaboration via FL. Experimental results show that siloed training leads to inconsistent and suboptimal performance, while privacy-preserving FL achieves accuracy comparable to conventional centralized training—an ideal yet not viable option due to data regulation constraints. Our findings highlight the effectiveness of FL as a secure solution for collaborative sociodemographic profiling in smart grids.

Privacy-Preserving Federated Learning for Household Characteristic Identification / Malan, Erich; De Vizia, Claudia; Castangia, Marco; Peluso, Valentino; Calimera, Andrea; Macii, Enrico. - (2025), pp. 2040-2045. (Intervento presentato al convegno 2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC) tenutosi a Toronto ON (CAN) nel 08-11 July 2025) [10.1109/compsac65507.2025.00285].

Privacy-Preserving Federated Learning for Household Characteristic Identification

Malan, Erich;De Vizia, Claudia;Castangia, Marco;Peluso, Valentino;Calimera, Andrea;Macii, Enrico
2025

Abstract

This work presents a privacy-preserving training framework for household characteristic identification from electricity consumption data. The proposed framework integrates two main components: (i) a synthetic data generation pipeline capable of replicating realistic energy traces from diverse family compositions, capturing fine-grained sociodemographic attributes such as household size, employment status, age groups, and home occupancy patterns; (ii) a training strategy based on Federated Learning (FL) secured with homomorphic encryption, enabling collaborative model training while preserving data ownership. Our synthetic dataset enables the performance assessment of different training scenarios, including siloed model training by individual energy utilities and secure collaboration via FL. Experimental results show that siloed training leads to inconsistent and suboptimal performance, while privacy-preserving FL achieves accuracy comparable to conventional centralized training—an ideal yet not viable option due to data regulation constraints. Our findings highlight the effectiveness of FL as a secure solution for collaborative sociodemographic profiling in smart grids.
2025
979-8-3315-7434-5
File in questo prodotto:
File Dimensione Formato  
2025179413 1.pdf

accesso aperto

Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: Pubblico - Tutti i diritti riservati
Dimensione 644.23 kB
Formato Adobe PDF
644.23 kB Adobe PDF Visualizza/Apri
Privacy-Preserving_Federated_Learning_for_Household_Characteristic_Identification.pdf

accesso riservato

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 647.65 kB
Formato Adobe PDF
647.65 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3002806