Object manipulation in unstructured environments is important for many industrial applications where the items vary in shape, size, and material. This paper introduces a two-step pipeline for object picking, which combines instance segmentation with a heuristic based grasp point selection. The grasping points are determined using the 2D segmentation masks and depth images. A voxel-downsampling procedure reduces the depth noise, and the Theil-Sen algorithm ensures a robust linear regression for the grasping attitude determination. Unlike other methods, our approach does not require extensive training, as well as a fine labelled dataset for picking, and hence it is also independent of object shapes. Using SAM's ViT-h version and a binary object detector trained on a large dataset, our method is robust and class agnostic. The experiments, made using a RealSense D435i camera and a Racer 3 manipulator, show that our pipeline has a good success rate in simple and moderately complex scenarios, balancing computational efficiency and accuracy.
Segmentation-Based Approach for a Heuristic Grasping Procedure in Multi-Object Scenes / Ceschini, Davide; Cesare, Riccardo De; Civitelli, Enrico; Indri, Marina. - ELETTRONICO. - (2024). (Intervento presentato al convegno IEEE ETFA - IEEE International Conference on Emerging Technologies and Factory Automation tenutosi a Padova (Italy) nel 10th-13th September, 2024) [10.1109/etfa61755.2024.10711021].
Segmentation-Based Approach for a Heuristic Grasping Procedure in Multi-Object Scenes
Ceschini, Davide;Cesare, Riccardo De;Indri, Marina
2024
Abstract
Object manipulation in unstructured environments is important for many industrial applications where the items vary in shape, size, and material. This paper introduces a two-step pipeline for object picking, which combines instance segmentation with a heuristic based grasp point selection. The grasping points are determined using the 2D segmentation masks and depth images. A voxel-downsampling procedure reduces the depth noise, and the Theil-Sen algorithm ensures a robust linear regression for the grasping attitude determination. Unlike other methods, our approach does not require extensive training, as well as a fine labelled dataset for picking, and hence it is also independent of object shapes. Using SAM's ViT-h version and a binary object detector trained on a large dataset, our method is robust and class agnostic. The experiments, made using a RealSense D435i camera and a Racer 3 manipulator, show that our pipeline has a good success rate in simple and moderately complex scenarios, balancing computational efficiency and accuracy.File | Dimensione | Formato | |
---|---|---|---|
ETFA24_WIP_Comau.pdf
accesso riservato
Descrizione: Editorial version
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
3.2 MB
Formato
Adobe PDF
|
3.2 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
ETFA24_WIP_Comau_AuthorsVersion.pdf
accesso aperto
Descrizione: Authors' version
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
Pubblico - Tutti i diritti riservati
Dimensione
3.19 MB
Formato
Adobe PDF
|
3.19 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2993583