Probing the Gaps in ChatGPT Live Video Chat for Real-World Assistance for People who are Blind or Visually Impaired
Chang, Ruei-Che, Natalie, Rosiana, Xu, Wenqian, Yap, Jovan Zheng Feng, Guo, Anhong
–arXiv.org Artificial Intelligence
Recent advancements in large multimodal models have provided blind or visually impaired (BVI) individuals with new capabilities to interpret and engage with the real world through interactive systems that utilize live video feeds. However, the potential benefits and challenges of such capabilities to support diverse real-world assistive tasks remain unclear. In this paper, we present findings from an exploratory study with eight BVI participants. Participants used ChatGPT's Advanced Voice with Video, a state-of-the-art live video AI released in late 2024, in various real-world scenarios, from locating objects to recognizing visual landmarks, across unfamiliar indoor and outdoor environments. Our findings indicate that current live video AI effectively provides guidance and answers for static visual scenes but falls short in delivering essential live descriptions required in dynamic situations. Despite inaccuracies in spatial and distance information, participants leveraged the provided visual information to supplement their mobility strategies. Although the system was perceived as human-like due to high-quality voice interactions, assumptions about users' visual abilities, hallucinations, generic responses, and a tendency towards sycophancy led to confusion, distrust, and potential risks for BVI users. Based on the results, we discuss implications for assistive video AI agents, including incorporating additional sensing capabilities for real-world use, determining appropriate intervention timing beyond turn-taking interactions, and addressing ecological and safety concerns.
arXiv.org Artificial Intelligence
Aug-6-2025
- Country:
- Asia
- Japan > Honshū
- Kantō > Kanagawa Prefecture > Yokohama (0.04)
- Middle East > Jordan (0.04)
- Japan > Honshū
- Europe
- France > Île-de-France
- Germany > Hamburg (0.04)
- Greece
- Attica > Athens (0.04)
- Ionian Islands > Corfu (0.04)
- United Kingdom > Scotland
- City of Glasgow > Glasgow (0.04)
- North America
- Canada
- Newfoundland and Labrador > Newfoundland
- St. John's (0.04)
- Quebec > Montreal (0.04)
- Newfoundland and Labrador > Newfoundland
- Puerto Rico > Peñuelas
- Peñuelas (0.04)
- United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- New York > New York County
- New York City (0.16)
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > San Jose (0.04)
- Texas > Bexar County
- San Antonio (0.04)
- New Jersey (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Colorado > Denver County
- Denver (0.15)
- Michigan > Washtenaw County
- Ann Arbor (0.14)
- Pennsylvania > Allegheny County
- Canada
- Asia
- Genre:
- Personal > Interview (0.93)
- Research Report > New Finding (1.00)
- Industry:
- Technology: