Advancing Code Generation from Visual Designs through Transformer-Based Architectures and Specialized Datasets

Calò, Tommaso; De Russis, Luigi

doi:10.1145/3734190

Manually translating web designs into code is a costly and time-consuming process, particularly due to the frequent iterations and refinements between designers and developers. Deep learning techniques, which possess the capability to automatically translate designs into functional code using an encoder-decoder architecture, have emerged as a promising solution to enhance this tedious process. However, many current methods depend on simplistic datasets that do not capture the diversity of components found in modern websites. Additionally, the potential of transformer-based models, which have enabled significant progress in vision and language modeling tasks due to their scalability and ability to handle cross-modal relationships, has not been investigated in this context. Addressing these limitations, this paper contributes with: 1) a web scraping methodology to automatically collect and process a diverse dataset of real-world websites with reduced noise and complexity, 2) a synthetic dataset of webpage mockups along with their sketched conversions, and 3) an evaluation of two recent multimodal transformer architectures on these proposed datasets. Results on synthetic and sketch-based datasets demonstrate the architectures potential as effective design-to-code automation solutions, while identifying remaining challenges in modeling real-world website complexity.

Advancing Code Generation from Visual Designs through Transformer-Based Architectures and Specialized Datasets / Calò, Tommaso; De Russis, Luigi. - In: PROCEEDINGS OF THE ACM ON HUMAN-COMPUTER INTERACTION. - ISSN 2573-0142. - STAMPA. - 9:4(2025), pp. 1-37. [10.1145/3734190]

Advancing Code Generation from Visual Designs through Transformer-Based Architectures and Specialized Datasets

Calò,Tommaso;De Russis, Luigi

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2025
			
	Codice DOI
	
				https://dx.doi.org/10.1145/3734190
			
	Titolo della Rivista
	
				PROCEEDINGS OF THE ACM ON HUMAN-COMPUTER INTERACTION
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
3734190.pdf accesso aperto Descrizione: Versione finale Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 1.88 MB Formato Adobe PDF Visualizza/Apri	1.88 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2999868

PORTO @ Archivio Istituzionale della Ricerca

Advancing Code Generation from Visual Designs through Transformer-Based Architectures and Specialized Datasets

Calò,Tommaso;De Russis, Luigi

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)