We present Le-RNR-Map, a Language-enhanced Renderable Neural Radiance map for Visual Navigation with natural language query prompts. The recently proposed RNR-Map employs a grid structure comprising latent codes positioned at each pixel. These latent codes, which are derived from image observation, enable: i) image rendering given a camera pose, since they are converted to Neural Radiance Field; ii) image navigation and localization with astonishing accuracy. On top of this, we enhance RNR-Map with CLIP-based embedding latent codes, allowing natural language search without additional label data. We evaluate the effectiveness of this map in single and multi-object searches. We also investigate its compatibility with a Large Language Model as an "affordance query resolver". Code and videos are available at the link https://intelligolabs.github.io/Le-RNR-Map/

Language-Enhanced RNR-Map: Querying Renderable Neural Radiance Field Maps with Natural Language / Taioli, Francesco; Cunico, Federico; Girella, Federico; Bologna, Riccardo; Farinelli, Alessandro; Cristani, Marco. - ELETTRONICO. - IEEE/CVF International Conference on Computer Vision Workshops (ICCVW):(In corso di stampa), pp. 4669-4674. (Intervento presentato al convegno International Conference on Computer Vision tenutosi a Paris, France nel October 2023).

Language-Enhanced RNR-Map: Querying Renderable Neural Radiance Field Maps with Natural Language

Taioli Francesco;
In corso di stampa

Abstract

We present Le-RNR-Map, a Language-enhanced Renderable Neural Radiance map for Visual Navigation with natural language query prompts. The recently proposed RNR-Map employs a grid structure comprising latent codes positioned at each pixel. These latent codes, which are derived from image observation, enable: i) image rendering given a camera pose, since they are converted to Neural Radiance Field; ii) image navigation and localization with astonishing accuracy. On top of this, we enhance RNR-Map with CLIP-based embedding latent codes, allowing natural language search without additional label data. We evaluate the effectiveness of this map in single and multi-object searches. We also investigate its compatibility with a Large Language Model as an "affordance query resolver". Code and videos are available at the link https://intelligolabs.github.io/Le-RNR-Map/
In corso di stampa
File in questo prodotto:
File Dimensione Formato  
Taioli_Language-Enhanced_RNR-Map_Querying_Renderable_Neural_Radiance_Field_Maps_with_Natural_ICCVW_2023_paper.pdf

non disponibili

Tipologia: 2. Post-print / Author's Accepted Manuscript
Licenza: Non Pubblico - Accesso privato/ristretto
Dimensione 933.45 kB
Formato Adobe PDF
933.45 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11583/2982846