Scene Graph Generation (SGG) is a powerful tool for autonomous vehicles to understand their environment. In this paper, a novel one-stage neuro-symbolic architecture called nEuro-Symbolic Relation trAnsformer (ESRA) is proposed and its applications to SGG in the field of autonomous driving are investigated. This one-stage architecture can perform both object and relation recognition in a single step, attempting to incorporate prior knowledge in the form of logical propositions grounded by a Logic Tensor Network (LTN). To the best of our knowledge, this is the first attempt to combine a transformer-based architecture with an LTN for SGG. The results show that the integration of LTN increases mean recall (mR) by up to 21% in the best configuration, with mAP achieving an increase of up to 19%.
ESRA: A Neuro-Symbolic Relation Transformer for Autonomous Driving / Russo, Alessandro; Morra, Lia; Lamberti, Fabrizio; Dimasi, PAOLO EMMANUEL ILARIO. - STAMPA. - (2024). (Intervento presentato al convegno International Joint Conference on Neural Networks (IJCNN) 2024 tenutosi a Yokohama (JPN) nel 30 June 2024 - 05 July 2024) [10.1109/IJCNN60899.2024.10651426].
ESRA: A Neuro-Symbolic Relation Transformer for Autonomous Driving
Alessandro Russo;Lia Morra;Fabrizio Lamberti;Paolo Emmanuel Ilario Dimasi
2024
Abstract
Scene Graph Generation (SGG) is a powerful tool for autonomous vehicles to understand their environment. In this paper, a novel one-stage neuro-symbolic architecture called nEuro-Symbolic Relation trAnsformer (ESRA) is proposed and its applications to SGG in the field of autonomous driving are investigated. This one-stage architecture can perform both object and relation recognition in a single step, attempting to incorporate prior knowledge in the form of logical propositions grounded by a Logic Tensor Network (LTN). To the best of our knowledge, this is the first attempt to combine a transformer-based architecture with an LTN for SGG. The results show that the integration of LTN increases mean recall (mR) by up to 21% in the best configuration, with mAP achieving an increase of up to 19%.| File | Dimensione | Formato | |
|---|---|---|---|
| IEEE_WCCI_2024.pdf accesso aperto 
											Tipologia:
											2. Post-print / Author's Accepted Manuscript
										 
											Licenza:
											
											
												Pubblico - Tutti i diritti riservati
												
												
												
											
										 
										Dimensione
										2.27 MB
									 
										Formato
										Adobe PDF
									 | 2.27 MB | Adobe PDF | Visualizza/Apri | 
| ESRA_a_Neuro-Symbolic_Relation_Transformer_for_Autonomous_Driving.pdf accesso riservato 
											Tipologia:
											2a Post-print versione editoriale / Version of Record
										 
											Licenza:
											
											
												Non Pubblico - Accesso privato/ristretto
												
												
												
											
										 
										Dimensione
										2.79 MB
									 
										Formato
										Adobe PDF
									 | 2.79 MB | Adobe PDF | Visualizza/Apri Richiedi una copia | 
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2990040
			
		
	
	
	
			      	