NaviSense: A Multimodal Assistive Mobile application for Object Retrieval by Persons with Visual Impairment
Sridhar, Ajay Narayanan, Qiao, Fuli, Aldas, Nelson Daniel Troncoso, Shi, Yanpei, Mahdavi, Mehrdad, Itti, Laurent, Narayanan, Vijaykrishnan
–arXiv.org Artificial Intelligence
People with visual impairments often face significant challenges in locating and retrieving objects in their surroundings. Existing assistive technologies present a trade-off: systems that offer precise guidance typically require pre-scanning or support only fixed object categories, while those with open-world object recognition lack spatial feedback for reaching the object. To address this gap, we introduce 'NaviSense', a mobile assistive system that combines conversational AI, vision-language models, augmented reality (AR), and LiDAR to support open-world object detection with real-time audio-haptic guidance. Users specify objects via natural language and receive continuous spatial feedback to navigate toward the target without needing prior setup. Designed with insights from a formative study and evaluated with 12 blind and low-vision participants, NaviSense significantly reduced object retrieval time and was preferred over existing tools, demonstrating the value of integrating open-world perception with precise, accessible guidance.
arXiv.org Artificial Intelligence
Sep-24-2025
- Country:
- Europe
- North America > United States
- California
- Los Angeles County > Los Angeles (0.28)
- Santa Clara County > Stanford (0.04)
- Colorado > Denver County
- Denver (0.04)
- New Jersey (0.04)
- New York > New York County
- New York City (0.04)
- Pennsylvania
- Allegheny County > Pittsburgh (0.04)
- Centre County > University Park (0.04)
- California
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks (0.47)
- Natural Language
- Chatbot (0.47)
- Large Language Model (0.70)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Communications > Mobile (1.00)
- Human Computer Interaction > Interfaces (1.00)
- Artificial Intelligence
- Information Technology