The paper introduces a foundational approach to motorsports scene understanding by investigating the role of synthetic data generation in advancing scene understanding for high-speed broadcast scenarios. Utilizing the CARLA simulation environment, the study constructs a high-fidelity dataset incorporating diverse lighting conditions, occlusions, and dynamic camera perspectives to enhance model generalization. A multi-stage data refinement pipeline is introduced to mitigate the impact of extreme occlusions and irrelevant samples while preserving the complexity of real-world challenges. Possible applications include 3D real-world understanding from a single monocular 2D image, which could open up interesting possibilities for augmented reality in broadcast media by allowing seamless integration of virtual elements, interactive graphics and dynamic visual effects, enhancing storytelling, audience engagement, and production flexibility. The efficacy of the dataset is further evaluated via transfer learning to the real-world domain, with the model pretrained on synthetic data demonstrating a significantly superior performance compared to its counterpart.

A Novel Synthetic Dataset for Broadcast Motorsports Scene Understanding / Rossi, Luca Francesco; Sanna, Andrea; Manuri, Federico; Donna Bianco, Mattia. - ELETTRONICO. - (2025), pp. 48-56. (Intervento presentato al convegno AIMEDIA 2025, The First International Conference on AI-based Media Innovation tenutosi a Venice (IT) nel July 06, 2025 - July 10, 2025).

A Novel Synthetic Dataset for Broadcast Motorsports Scene Understanding

Rossi, Luca Francesco;Sanna, Andrea;Manuri, Federico;
2025

Abstract

The paper introduces a foundational approach to motorsports scene understanding by investigating the role of synthetic data generation in advancing scene understanding for high-speed broadcast scenarios. Utilizing the CARLA simulation environment, the study constructs a high-fidelity dataset incorporating diverse lighting conditions, occlusions, and dynamic camera perspectives to enhance model generalization. A multi-stage data refinement pipeline is introduced to mitigate the impact of extreme occlusions and irrelevant samples while preserving the complexity of real-world challenges. Possible applications include 3D real-world understanding from a single monocular 2D image, which could open up interesting possibilities for augmented reality in broadcast media by allowing seamless integration of virtual elements, interactive graphics and dynamic visual effects, enhancing storytelling, audience engagement, and production flexibility. The efficacy of the dataset is further evaluated via transfer learning to the real-world domain, with the model pretrained on synthetic data demonstrating a significantly superior performance compared to its counterpart.
2025
978-1-68558-330-9
File in questo prodotto:
File Dimensione Formato  
aimedia_2025_1_80_40049.pdf

accesso aperto

Descrizione: Final version
Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Pubblico - Tutti i diritti riservati
Dimensione 7.11 MB
Formato Adobe PDF
7.11 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/3001714