Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges
Akter, Sanjeda, Shihab, Ibne Farabi, Sharma, Anuj
–arXiv.org Artificial Intelligence
Crash detection from video feeds is a critical problem in intelligent transportation systems. Recent developments in large language models (LLMs) and vision-language models (VLMs) have transformed how we process, reason about, and summarize multimodal information. This paper surveys recent methods leveraging LLMs for crash detection from video data. We present a structured taxonomy of fusion strategies, summarize key datasets, analyze model architectures, compare performance benchmarks, and discuss ongoing challenges and opportunities. Our review provides a foundation for future research in this fast-growing intersection of video understanding and foundation models.
arXiv.org Artificial Intelligence
Sep-10-2025
- Country:
- North America > United States > Iowa > Story County > Ames (0.04)
- Genre:
- Overview (1.00)
- Industry:
- Information Technology > Security & Privacy (0.67)
- Transportation (1.00)
- Technology: