QuASH: Using Natural-Language Heuristics to Query Visual-Language Robotic Maps
Pekkanen, Matti, Verdoja, Francesco, Kyrki, Ville
–arXiv.org Artificial Intelligence
Embeddings from Visual-Language Models are increasingly utilized to represent semantics in robotic maps, offering an open-vocabulary scene understanding that surpasses traditional, limited labels. Embeddings enable on-demand querying by comparing embedded user text prompts to map embeddings via a similarity metric. The key challenge in performing the task indicated in a query is that the robot must determine the parts of the environment relevant to the query. This paper proposes a solution to this challenge. We leverage natural-language synonyms and antonyms associated with the query within the embedding space, applying heuristics to estimate the language space relevant to the query, and use that to train a classifier to partition the environment into matches and non-matches. We evaluate our method through extensive experiments, querying both maps and standard image benchmarks. The results demonstrate increased queryability of maps and images. Our querying technique is agnostic to the representation and encoder used, and requires limited training.
arXiv.org Artificial Intelligence
Oct-17-2025
- Country:
- Asia
- China
- Shandong Province > Qingdao (0.04)
- Zhejiang Province > Hangzhou (0.04)
- Japan > Honshū
- Chūbu > Ishikawa Prefecture > Kanazawa (0.04)
- Middle East
- Israel > Tel Aviv District
- Tel Aviv (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Israel > Tel Aviv District
- South Korea > Daegu
- Daegu (0.04)
- China
- Europe
- Finland (0.04)
- France > Île-de-France
- Italy > Lombardy
- Milan (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- North America
- Canada > British Columbia
- Vancouver (0.04)
- United States
- California > Los Angeles County
- Long Beach (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Los Angeles County
- Canada > British Columbia
- Asia
- Genre:
- Research Report > New Finding (0.34)
- Technology:
- Information Technology > Artificial Intelligence
- Natural Language > Large Language Model (0.98)
- Robots (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence