Efficient Training of Energy-Based Models Using Jarzynski Equality

Carbone, Davide; Hua, Mengjian; Coste, Simon; Vanden-Eijnden, Eric

Energy-based models (EBMs) are generative models inspired by statistical physics with a wide range of applications in unsupervised learning. Their performance is well measured by the cross-entropy (CE) of the model distribution relative to the data distribution. Using the CE as the objective for training is however challenging because the computation of its gradient with respect to the model parameters requires sampling the model distribution. Here we show how results for nonequilibrium thermodynamics based on Jarzynski equality together with tools from sequential Monte-Carlo sampling can be used to perform this computation efficiently and avoid the uncontrolled approximations made using the standard contrastive divergence algorithm. Specifically, we introduce a modification of the unadjusted Langevin algorithm (ULA) in which each walker acquires a weight that enables the estimation of the gradient of the cross-entropy at any step during GD, thereby bypassing sampling biases induced by slow mixing of ULA. We illustrate these results with numerical experiments on Gaussian mixture distributions as well as the MNIST and CIFAR-10 datasets. We show that the proposed approach outperforms methods based on the contrastive divergence algorithm in all the considered situations.

Efficient Training of Energy-Based Models Using Jarzynski Equality / Carbone, Davide; Hua, Mengjian; Coste, Simon; Vanden-Eijnden, Eric. - ELETTRONICO. - (2023), pp. 1-32. (Intervento presentato al convegno 37th Conference on Neural Information Processing Systems (NeurIPS) tenutosi a New Orleans (USA) nel DEC 10-16, 2023).

Efficient Training of Energy-Based Models Using Jarzynski Equality

Carbone, Davide;Hua, Mengjian;Coste, Simon;Vanden-Eijnden, Eric

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Anno del prodotto

2023

Appare nelle tipologie

4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
NeurIPS-2023-efficient-training-of-energy-based-models-using-jarzynski-equality-Paper-Conference (1).pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Pubblico - Tutti i diritti riservati Dimensione 4.45 MB Formato Adobe PDF Visualizza/Apri	4.45 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2988556

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

PORTO @ Archivio Istituzionale della Ricerca

Efficient Training of Energy-Based Models Using Jarzynski Equality

Carbone, Davide;Hua, Mengjian;Coste, Simon;Vanden-Eijnden, Eric

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)