JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition

Berton, Gabriele; Trivigno, Gabriele; Caputo, Barbara; Masone, Carlo

doi:10.1109/LRA.2023.3339058

Visual Place Recognition aims at recognizing previously visited places by relying on visual clues, and it is used in robotics applications for SLAM and localization. Since typically a mobile robot has access to a continuous stream of frames, this task is naturally cast as a sequence-to-sequence localization problem. Nevertheless, obtaining sequences of labelled data is much more expensive than collecting isolated images, which can be done in an automated way with little supervision. As a mitigation to this problem, we propose a novel Joint Image and Sequence Training (JIST) protocol that leverages large uncurated sets of images through a multi-task learning framework. With JIST we also introduce SeqGeM, an aggregation layer that revisits the popular GeM pooling to produce a single robust and compact embedding from a sequence of single-frame embeddings. We show that our model is able to outperform previous state of the art while being faster, using eight times smaller descriptors, having a lighter architecture and allowing to process sequences of various lengths.

JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition / Berton, Gabriele; Trivigno, Gabriele; Caputo, Barbara; Masone, Carlo. - In: IEEE ROBOTICS AND AUTOMATION LETTERS. - ISSN 2377-3766. - 9:2(2024), pp. 1310-1317. [10.1109/LRA.2023.3339058]

JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition

Berton, Gabriele;Trivigno, Gabriele;Caputo, Barbara;Masone, Carlo

2024

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2024
			
	Codice DOI
	
				https://dx.doi.org/10.1109/LRA.2023.3339058
			
	Titolo della Rivista
	
				IEEE ROBOTICS AND AUTOMATION LETTERS
			
	Appare nelle tipologie
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
JIST.pdf accesso aperto Tipologia: 2. Post-print / Author's Accepted Manuscript Licenza: Creative commons Dimensione 846.3 kB Formato Adobe PDF Visualizza/Apri	846.3 kB	Adobe PDF	Visualizza/Apri
JIST_Joint_Image_and_Sequence_Training_for_Sequential_Visual_Place_Recognition.pdf accesso aperto Tipologia: 2a Post-print versione editoriale / Version of Record Licenza: Creative commons Dimensione 1.34 MB Formato Adobe PDF Visualizza/Apri	1.34 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2984192

PORTO @ Archivio Istituzionale della Ricerca

JIST: Joint Image and Sequence Training for Sequential Visual Place Recognition

Berton, Gabriele;Trivigno, Gabriele;Caputo, Barbara;Masone, Carlo

2024

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Pubblicazioni consigliate

Informazioni

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)