Jointly Training Large Autoregressive Multimodal Models

Aiello, Emanuele; Lili, Yu; Nie, Yixin; Aghajanyan, Armen; Oguz, Barlas

In recent years, advances in the large-scale pretraining of language and text-to-image models have revolutionized the field of machine learning. Yet, integrating these two modalities into a single, robust model capable of generating seamless multimodal outputs remains a significant challenge. To address this gap, we present the Joint Autoregressive Mixture (JAM) framework, a modular approach that systematically fuses existing text and image generation models. We also introduce a specialized, data-efficient instruction-tuning strategy, tailored for mixed-modal generation tasks. Our final instruct-tuned model demonstrates unparalleled performance in generating high-quality multimodal outputs and represents the first model explicitly designed for this purpose.

Jointly Training Large Autoregressive Multimodal Models / Aiello, Emanuele; Yu, Lili; Nie, Yixin; Aghajanyan, Armen; Oguz, Barlas. - ELETTRONICO. - (2023). ( International Conference on Learning Representations Vienna (Austria) Aprile 2024).

Jointly Training Large Autoregressive Multimodal Models

Emanuele Aiello;Lili Yu;Yixin Nie;Armen Aghajanyan;Barlas Oguz

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Anno del prodotto

2023

Appare nelle tipologie

4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Aiello-Jointly.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Pubblico - Tutti i diritti riservati Dimensione 1.29 MB Formato Adobe PDF Visualizza/Apri	1.29 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2990402

PORTO @ Archivio Istituzionale della Ricerca

Jointly Training Large Autoregressive Multimodal Models

Emanuele Aiello;Lili Yu;Yixin Nie;Armen Aghajanyan;Barlas Oguz

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)