This work presents a privacy-preserving training framework for household characteristic identification from electricity consumption data. The proposed framework integrates two main components: (i) a synthetic data generation pipeline capable of replicating realistic energy traces from diverse family compositions, capturing fine-grained sociodemographic attributes such as household size, employment status, age groups, and home occupancy patterns; (ii) a training strategy based on Federated Learning (FL) secured with homomorphic encryption, enabling collaborative model training while preserving data ownership. Our synthetic dataset enables the performance assessment of different training scenarios, including siloed model training by individual energy utilities and secure collaboration via FL. Experimental results show that siloed training leads to inconsistent and suboptimal performance, while privacy-preserving FL achieves accuracy comparable to conventional centralized training—an ideal yet not viable option due to data regulation constraints. Our findings highlight the effectiveness of FL as a secure solution for collaborative sociodemographic profiling in smart grids.
Privacy-Preserving Federated Learning for Household Characteristic Identification / Malan, Erich; De Vizia, Claudia; Castangia, Marco; Peluso, Valentino; Calimera, Andrea; Macii, Enrico. - (2025), pp. 2040-2045. (Intervento presentato al convegno 2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC) tenutosi a Toronto ON (CAN) nel 08-11 July 2025) [10.1109/compsac65507.2025.00285].
Privacy-Preserving Federated Learning for Household Characteristic Identification
Malan, Erich;De Vizia, Claudia;Castangia, Marco;Peluso, Valentino;Calimera, Andrea;Macii, Enrico
2025
Abstract
This work presents a privacy-preserving training framework for household characteristic identification from electricity consumption data. The proposed framework integrates two main components: (i) a synthetic data generation pipeline capable of replicating realistic energy traces from diverse family compositions, capturing fine-grained sociodemographic attributes such as household size, employment status, age groups, and home occupancy patterns; (ii) a training strategy based on Federated Learning (FL) secured with homomorphic encryption, enabling collaborative model training while preserving data ownership. Our synthetic dataset enables the performance assessment of different training scenarios, including siloed model training by individual energy utilities and secure collaboration via FL. Experimental results show that siloed training leads to inconsistent and suboptimal performance, while privacy-preserving FL achieves accuracy comparable to conventional centralized training—an ideal yet not viable option due to data regulation constraints. Our findings highlight the effectiveness of FL as a secure solution for collaborative sociodemographic profiling in smart grids.File | Dimensione | Formato | |
---|---|---|---|
2025179413 1.pdf
accesso aperto
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
Pubblico - Tutti i diritti riservati
Dimensione
644.23 kB
Formato
Adobe PDF
|
644.23 kB | Adobe PDF | Visualizza/Apri |
Privacy-Preserving_Federated_Learning_for_Household_Characteristic_Identification.pdf
accesso riservato
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
647.65 kB
Formato
Adobe PDF
|
647.65 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/3002806