We present Le-RNR-Map, a Language-enhanced Renderable Neural Radiance map for Visual Navigation with natural language query prompts. The recently proposed RNR-Map employs a grid structure comprising latent codes positioned at each pixel. These latent codes, which are derived from image observation, enable: i) image rendering given a camera pose, since they are converted to Neural Radiance Field; ii) image navigation and localization with astonishing accuracy. On top of this, we enhance RNR-Map with CLIP-based embedding latent codes, allowing natural language search without additional label data. We evaluate the effectiveness of this map in single and multi-object searches. We also investigate its compatibility with a Large Language Model as an "affordance query resolver". Code and videos are available at the link https://intelligolabs.github.io/Le-RNR-Map/
Language-Enhanced RNR-Map: Querying Renderable Neural Radiance Field Maps with Natural Language / Taioli, Francesco; Cunico, Federico; Girella, Federico; Bologna, Riccardo; Farinelli, Alessandro; Cristani, Marco. - ELETTRONICO. - (2023), pp. 4671-4676. (Intervento presentato al convegno International Conference on Computer Vision tenutosi a Paris (FRA) nel 02-06 October 2023) [10.1109/ICCVW60793.2023.00504].
Language-Enhanced RNR-Map: Querying Renderable Neural Radiance Field Maps with Natural Language
Taioli Francesco;
2023
Abstract
We present Le-RNR-Map, a Language-enhanced Renderable Neural Radiance map for Visual Navigation with natural language query prompts. The recently proposed RNR-Map employs a grid structure comprising latent codes positioned at each pixel. These latent codes, which are derived from image observation, enable: i) image rendering given a camera pose, since they are converted to Neural Radiance Field; ii) image navigation and localization with astonishing accuracy. On top of this, we enhance RNR-Map with CLIP-based embedding latent codes, allowing natural language search without additional label data. We evaluate the effectiveness of this map in single and multi-object searches. We also investigate its compatibility with a Large Language Model as an "affordance query resolver". Code and videos are available at the link https://intelligolabs.github.io/Le-RNR-Map/File | Dimensione | Formato | |
---|---|---|---|
Taioli_Language-Enhanced_RNR-Map_Querying_Renderable_Neural_Radiance_Field_Maps_with_Natural_ICCVW_2023_paper.pdf
accesso aperto
Tipologia:
2. Post-print / Author's Accepted Manuscript
Licenza:
PUBBLICO - Tutti i diritti riservati
Dimensione
933.45 kB
Formato
Adobe PDF
|
933.45 kB | Adobe PDF | Visualizza/Apri |
Language-enhanced_RNR-Map_Querying_Renderable_Neural_Radiance_Field_maps_with_natural_language.pdf
non disponibili
Tipologia:
2a Post-print versione editoriale / Version of Record
Licenza:
Non Pubblico - Accesso privato/ristretto
Dimensione
1.3 MB
Formato
Adobe PDF
|
1.3 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.
https://hdl.handle.net/11583/2982846