Scene Graph Generation (SGG) is a powerful tool for autonomous vehicles to understand their environment. In this paper, a novel one-stage neuro-symbolic architecture called nEuro-Symbolic Relation trAnsformer (ESRA) is proposed and its applications to SGG in the field of autonomous driving are investigated. This one-stage architecture can perform both object and relation recognition in a single step, attempting to incorporate prior knowledge in the form of logical propositions grounded by a Logic Tensor Network (LTN). To the best of our knowledge, this is the first attempt to combine a transformer-based architecture with an LTN for SGG. The results show that the integration of LTN increases mean recall (mR) by up to 21% in the best configuration, with mAP achieving an increase of up to 19%.
ESRA: A Neuro-Symbolic Relation Transformer for Autonomous Driving / Russo, Alessandro; Morra, Lia; Lamberti, Fabrizio; Dimasi, PAOLO EMMANUEL ILARIO. - STAMPA. - (2024). (Intervento presentato al convegno International Joint Conference on Neural Networks (IJCNN) 2024 tenutosi a Yokohama (JPN) nel 30 June 2024 - 05 July 2024) [10.1109/IJCNN60899.2024.10651426].
ESRA: A Neuro-Symbolic Relation Transformer for Autonomous Driving
Alessandro Russo;Lia Morra;Fabrizio Lamberti;Paolo Emmanuel Ilario Dimasi
2024
Abstract
Scene Graph Generation (SGG) is a powerful tool for autonomous vehicles to understand their environment. In this paper, a novel one-stage neuro-symbolic architecture called nEuro-Symbolic Relation trAnsformer (ESRA) is proposed and its applications to SGG in the field of autonomous driving are investigated. This one-stage architecture can perform both object and relation recognition in a single step, attempting to incorporate prior knowledge in the form of logical propositions grounded by a Logic Tensor Network (LTN). To the best of our knowledge, this is the first attempt to combine a transformer-based architecture with an LTN for SGG. The results show that the integration of LTN increases mean recall (mR) by up to 21% in the best configuration, with mAP achieving an increase of up to 19%.File | Dimensione | Formato | |
---|---|---|---|
IEEE_WCCI_2024.pdf
accesso aperto
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
Pubblico - Tutti i diritti riservati
Dimensione
2.27 MB
Formato
Adobe PDF
|
2.27 MB | Adobe PDF | Visualizza/Apri |
ESRA_a_Neuro-Symbolic_Relation_Transformer_for_Autonomous_Driving.pdf
accesso riservato
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
2.79 MB
Formato
Adobe PDF
|
2.79 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2990040