MOTIONCRAFT: Physics-based Zero-Shot Video Generation

Montanaro, A.; Savant Aira, Luca; Aiello, E.; Valsesia, D.; Magli, E.

Generating videos with realistic and physically plausible motion is one of the main recent challenges in computer vision. While diffusion models are achieving compelling results in image generation, video diffusion models are limited by heavy training and huge models, resulting in videos that are still biased to the training dataset. In this work we propose MotionCraft, a new zero-shot video generator to craft physics-based and realistic videos. MotionCraft is able to warp the noise latent space of an image diffusion model, such as Stable Diffusion, by applying an optical flow derived from a physics simulation. We show that warping the noise latent space results in coherent application of the desired motion while allowing the model to generate missing elements consistent with the scene evolution, which would otherwise result in artefacts or missing content if the flow was applied in the pixel space. We compare our method with the state-of-the-art Text2Video-Zero reporting qualitative and quantitative improvements, demonstrating the effectiveness of our approach to generate videos with finely-prescribed complex motion dynamics.

MOTIONCRAFT: Physics-based Zero-Shot Video Generation / Montanaro, A.; SAVANT AIRA, Luca; Aiello, E.; Valsesia, D.; Magli, E.. - 37:(2024). (Intervento presentato al convegno 38th Conference on Neural Information Processing Systems, NeurIPS 2024 tenutosi a Vancouver (Can) nel 10 - 15 December 2024).

MOTIONCRAFT: Physics-based Zero-Shot Video Generation

Montanaro A.;Aira Savant Luca;Aiello E.;Valsesia D.;Magli E.

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2024
			
	Titolo della Serie/Collana
	
				ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS
			
	Appare nelle tipologie
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
NeurIPS-2024-motioncraft-physics-based-zero-shot-video-generation-Paper-Conference_compressed-1.pdf accesso riservato Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Non Pubblico - Accesso privato/ristretto Dimensione 945.74 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	945.74 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
Flow_based_video_generation-4_compressed.pdf accesso aperto Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Pubblico - Tutti i diritti riservati Dimensione 2.58 MB Formato Adobe PDF Visualizza/Apri	2.58 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3000132

PORTO @ Archivio Istituzionale della Ricerca

MOTIONCRAFT: Physics-based Zero-Shot Video Generation

Montanaro A.;Aira Savant Luca;Aiello E.;Valsesia D.;Magli E.

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)