We present Le-RNR-Map, a Language-enhanced Renderable Neural Radiance map for Visual Navigation with natural language query prompts. The recently proposed RNR-Map employs a grid structure comprising latent codes positioned at each pixel. These latent codes, which are derived from image observation, enable: i) image rendering given a camera pose, since they are converted to Neural Radiance Field; ii) image navigation and localization with astonishing accuracy. On top of this, we enhance RNR-Map with CLIP-based embedding latent codes, allowing natural language search without additional label data. We evaluate the effectiveness of this map in single and multi-object searches. We also investigate its compatibility with a Large Language Model as an "affordance query resolver". Code and videos are available at the link https://intelligolabs.github.io/Le-RNR-Map/

Language-Enhanced RNR-Map: Querying Renderable Neural Radiance Field Maps with Natural Language / Taioli, Francesco; Cunico, Federico; Girella, Federico; Bologna, Riccardo; Farinelli, Alessandro; Cristani, Marco. - ELETTRONICO. - (2023), pp. 4671-4676. (Intervento presentato al convegno International Conference on Computer Vision tenutosi a Paris (FRA) nel 02-06 October 2023) [10.1109/ICCVW60793.2023.00504].

Language-Enhanced RNR-Map: Querying Renderable Neural Radiance Field Maps with Natural Language

Taioli Francesco;
2023

Abstract

We present Le-RNR-Map, a Language-enhanced Renderable Neural Radiance map for Visual Navigation with natural language query prompts. The recently proposed RNR-Map employs a grid structure comprising latent codes positioned at each pixel. These latent codes, which are derived from image observation, enable: i) image rendering given a camera pose, since they are converted to Neural Radiance Field; ii) image navigation and localization with astonishing accuracy. On top of this, we enhance RNR-Map with CLIP-based embedding latent codes, allowing natural language search without additional label data. We evaluate the effectiveness of this map in single and multi-object searches. We also investigate its compatibility with a Large Language Model as an "affordance query resolver". Code and videos are available at the link https://intelligolabs.github.io/Le-RNR-Map/
2023
979-8-3503-0744-3
File in questo prodotto:
File Dimensione Formato  
Taioli_Language-Enhanced_RNR-Map_Querying_Renderable_Neural_Radiance_Field_Maps_with_Natural_ICCVW_2023_paper.pdf

accesso aperto

Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: PUBBLICO - Tutti i diritti riservati
Dimensione 933.45 kB
Formato Adobe PDF
933.45 kB Adobe PDF Visualizza/Apri
Language-enhanced_RNR-Map_Querying_Renderable_Neural_Radiance_Field_maps_with_natural_language.pdf

non disponibili

Tipologia: 2a Post-print versione editoriale / Version of Record
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 1.3 MB
Formato Adobe PDF
1.3 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2982846