Vi-LAD: Vision-Language Attention Distillation for Socially-Aware Robot Navigation in Dynamic Environments
Elnoor, Mohamed, Weerakoon, Kasun, Seneviratne, Gershom, Liang, Jing, Rajagopal, Vignesh, Manocha, Dinesh
–arXiv.org Artificial Intelligence
-- We introduce Vision-Language Attention Distillation (Vi-LAD), a novel approach for distilling socially compliant navigation knowledge from a large Vision-Language Model (VLM) into a lightweight transformer model for real-time robotic navigation. Unlike traditional methods that rely on expert demonstrations or human-annotated datasets, Vi-LAD performs knowledge distillation and fine-tuning at the intermediate layer representation level (i.e., attention maps) by leveraging the backbone of a pre-trained vision-action model. These attention maps highlight key navigational regions in a given scene, which serve as implicit guidance for socially aware motion planning. Vi-LAD fine-tunes a transformer-based model using intermediate attention maps extracted from the pre-trained vision-action model, combined with attention-like semantic maps constructed from a large VLM. T o achieve this, we introduce a novel attention-level distillation loss that fuses knowledge from both sources, generating augmented attention maps with enhanced social awareness. These refined attention maps are then utilized as a traversability costmap within a socially aware model predictive controller (MPC) for navigation. We validate our approach through real-world experiments on a Husky wheeled robot, demonstrating significant improvements over state-of-the-art (SOT A) navigation methods. Our results show up to 14.2% - 50% improvement in success rate, which highlights the effectiveness of Vi-LAD in enabling socially compliant and efficient robot navigation. I NTRODUCTION As autonomous robots become increasingly integrated into human-centered environments, ensuring safe, efficient, and socially compliant navigation is a critical challenge [1].
arXiv.org Artificial Intelligence
Mar-12-2025
- Country:
- North America > United States > Maryland > Prince George's County > College Park (0.14)
- Genre:
- Overview > Innovation (0.34)
- Research Report
- New Finding (0.54)
- Promising Solution (0.34)
- Technology: