Generative Artificial Intelligence (GenAI) is emerging as a transformative medium for democratizing participatory urban design, potentially bridging the gap between citizens' conceptualizations and professional representations. While current GenAI tools are divided between professional-grade platforms and accessible solutions for non-experts, technical challenges persist in generating precise and contextually relevant visualizations. This research investigates the optimization of inference conditioning techniques through an open-source approach, implementing an integrated framework within ComfyUI that leverages local computing resources. The methodology combines three key enhancements: prompt engineering through Large Language Models (LLMs), fine-tuning through Low-Rank Adaptation (LoRA), and structural control through ControlNet implementations. Testing this framework on two case studies in Pisa's historical center and suburban area demonstrated how the synergistic combination of these techniques significantly improves the quality and contextual relevance of generated visualizations. Results suggest that advanced conditioning strategies can effectively balance accessibility and precision in participatory urban design tools, supporting the development of more inclusive and sustainable urban transformation processes aligned with the UN's 2030 Agenda goals.

Optimizing Inference Conditioning Techniques in Image Generation for Participatory Urban Transformation / Pupi, Enrico; Rechichi, Piergiuseppe. - ELETTRONICO. - (2025), pp. 225-235. ( EVA Berlin Conference 2025 - Electronic Media and Visual Arts Berlin (DEU) 12, 13, 14 March 2025) [10.11588/arthistoricum.1568.c24100].

Optimizing Inference Conditioning Techniques in Image Generation for Participatory Urban Transformation

Enrico Pupi;
2025

Abstract

Generative Artificial Intelligence (GenAI) is emerging as a transformative medium for democratizing participatory urban design, potentially bridging the gap between citizens' conceptualizations and professional representations. While current GenAI tools are divided between professional-grade platforms and accessible solutions for non-experts, technical challenges persist in generating precise and contextually relevant visualizations. This research investigates the optimization of inference conditioning techniques through an open-source approach, implementing an integrated framework within ComfyUI that leverages local computing resources. The methodology combines three key enhancements: prompt engineering through Large Language Models (LLMs), fine-tuning through Low-Rank Adaptation (LoRA), and structural control through ControlNet implementations. Testing this framework on two case studies in Pisa's historical center and suburban area demonstrated how the synergistic combination of these techniques significantly improves the quality and contextual relevance of generated visualizations. Results suggest that advanced conditioning strategies can effectively balance accessibility and precision in participatory urban design tools, supporting the development of more inclusive and sustainable urban transformation processes aligned with the UN's 2030 Agenda goals.
2025
978-3-98501-333-3
File in questo prodotto:
File Dimensione Formato  
Eva-Berlin-2025_Optimizing_Inference.pdf

accesso aperto

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Creative commons
Dimensione 1.13 MB
Formato Adobe PDF
1.13 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3005848