A Modern Take on Visual Relationship Reasoning for Grasp Planning

Rabino, Paolo; Tommasi, Tatiana

doi:10.1109/LRA.2024.3524910

Interacting with real-world cluttered scenes poses several challenges to robotic agents that need to understand complex spatial dependencies among the observed objects to determine optimal pick sequences or efficient object retrieval strategies. Existing solutions typically manage simplified scenarios and focus on predicting pairwise object relationships following an initial object detection phase, but often overlook the global context or struggle with handling redundant and missing object relations. In this work, we present a modern take on visual relational reasoning for grasp planning. We introduce D3GD, a novel testbed that includes bin picking scenes with up to 35 objects from 97 distinct categories. Additionally, we propose D3G, a new end-to-end transformer-based dependency graph generation model that simultaneously detects objects and produces an adjacency matrix representing their spatial relationships. Recognizing the limitations of standard metrics, we employ the Average Precision of Relationships for the first time to evaluate model performance, conducting an extensive experimental benchmark. The obtained results establish our approach as the new state-of-the-art for this task, laying the foundation for future research in robotic manipulation. We publicly release the code and dataset at https://paolotron.github.io/d3g.github.io

A Modern Take on Visual Relationship Reasoning for Grasp Planning / Rabino, P., Tommasi, T.. - In: IEEE ROBOTICS AND AUTOMATION LETTERS. - ISSN 2377-3766. - ELETTRONICO. - 10:2(2025), pp. 1712-1719. [10.1109/LRA.2024.3524910]

A Modern Take on Visual Relationship Reasoning for Grasp Planning

Paolo Rabino;Tatiana Tommasi

2025

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2025
			
	Codice DOI
	
				https://dx.doi.org/10.1109/LRA.2024.3524910
			
	Titolo della Rivista
	
				IEEE ROBOTICS AND AUTOMATION LETTERS
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
A_Modern_Take_on_Visual_Relationship_Reasoning_for_Grasp_Planning.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 5.75 MB Formato Adobe PDF Visualizza/Apri	5.75 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2996380

PORTO @ Archivio Istituzionale della Ricerca

A Modern Take on Visual Relationship Reasoning for Grasp Planning

Paolo Rabino;Tatiana Tommasi

2025

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)