Object manipulation in unstructured environments is important for many industrial applications where the items vary in shape, size, and material. This paper introduces a two-step pipeline for object picking, which combines instance segmentation with a heuristic based grasp point selection. The grasping points are determined using the 2D segmentation masks and depth images. A voxel-downsampling procedure reduces the depth noise, and the Theil-Sen algorithm ensures a robust linear regression for the grasping attitude determination. Unlike other methods, our approach does not require extensive training, as well as a fine labelled dataset for picking, and hence it is also independent of object shapes. Using SAM's ViT-h version and a binary object detector trained on a large dataset, our method is robust and class agnostic. The experiments, made using a RealSense D435i camera and a Racer 3 manipulator, show that our pipeline has a good success rate in simple and moderately complex scenarios, balancing computational efficiency and accuracy.

Segmentation-Based Approach for a Heuristic Grasping Procedure in Multi-Object Scenes / Ceschini, Davide; Cesare, Riccardo De; Civitelli, Enrico; Indri, Marina. - ELETTRONICO. - (2024). (Intervento presentato al convegno IEEE ETFA - IEEE International Conference on Emerging Technologies and Factory Automation tenutosi a Padova (Italy) nel 10th-13th September, 2024) [10.1109/etfa61755.2024.10711021].

Segmentation-Based Approach for a Heuristic Grasping Procedure in Multi-Object Scenes

Ceschini, Davide;Cesare, Riccardo De;Indri, Marina
2024

Abstract

Object manipulation in unstructured environments is important for many industrial applications where the items vary in shape, size, and material. This paper introduces a two-step pipeline for object picking, which combines instance segmentation with a heuristic based grasp point selection. The grasping points are determined using the 2D segmentation masks and depth images. A voxel-downsampling procedure reduces the depth noise, and the Theil-Sen algorithm ensures a robust linear regression for the grasping attitude determination. Unlike other methods, our approach does not require extensive training, as well as a fine labelled dataset for picking, and hence it is also independent of object shapes. Using SAM's ViT-h version and a binary object detector trained on a large dataset, our method is robust and class agnostic. The experiments, made using a RealSense D435i camera and a Racer 3 manipulator, show that our pipeline has a good success rate in simple and moderately complex scenarios, balancing computational efficiency and accuracy.
2024
979-8-3503-6123-0
File in questo prodotto:
File Dimensione Formato  
ETFA24_WIP_Comau.pdf

accesso riservato

Descrizione: Editorial version
Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 3.2 MB
Formato Adobe PDF
3.2 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
ETFA24_WIP_Comau_AuthorsVersion.pdf

accesso aperto

Descrizione: Authors' version
Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: Pubblico - Tutti i diritti riservati
Dimensione 3.19 MB
Formato Adobe PDF
3.19 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2993583