VLM as Strategist: Adaptive Generation of Safety-critical Testing Scenarios via Guided Diffusion

Wu, Xinzheng, Chen, Junyi, Zhong, Naiting, Shen, Yong

Dec-3-2025–arXiv.org Artificial Intelligence

Autonomous driving technology is spearheading a transformation in the global automotive industries, and its safe and reliable implementation is the core prerequisite for large-scale adoption (Ren et al., 2025). Comprehensive testing and evaluation of autonomous driving systems (ADSs) are essential to ensuring their safety, in which the identification and generation of safety-critical scenarios represent a core challenge (Yang et al., 2025). "Safety-critical scenarios" specifically refer to rare driving situations with potentially high risks (Ding et al., 2023). Conducting tests under such scenarios enables effective evaluation of the ADSs' safety performance, as well as the clarification and iterative refinement of its Operational Design Domain (ODD). However, due to the rarity of safety-critical scenarios in naturalistic driving environments (Feng et al., 2023), real-world road testing is inefficient and cost-prohibitive, making it unsuitable for large-scale testing of high-level ADSs. As a more efficient and practical solution, simulation-based testing has garnered significant industrial and scholarly attention (Sun et al., 2022). In recent years, engineers in enterprises generally extract safety-critical testing scenarios by directly replaying vehicle-collected data in simulation environments (Liu et al., 2024), while some researchers achieve accelerated sampling of safety-critical scenarios through optimization-based search within a predefined scenario parameter space (Wu et al., 2024, 2026). However, the background vehicles (BVs) in the safety-critical testing scenarios generated by the aforementioned methods exhibit fixed behaviors and cannot dynamically respond to the actions of the vehicle under test (VUT). As a remedy, some other studies have introduced reinforcement learning to train adversarial BV driver models, thereby constructing naturalistic adversarial driving environments (NADE) (Feng et al., 2021) or evolving scenarios (Ma et al., 2024; Wu et al., 2025).

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

Dec-3-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.93)

Genre:
- Research Report (1.00)
- Workflow (0.94)

Industry:
- Automobiles & Trucks (1.00)
- Transportation > Ground
  - Road (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (1.00)
  - Robots > Autonomous Vehicles (0.88)
  - Natural Language > Large Language Model (0.69)
  - Representation & Reasoning > Agents (0.68)
  - Cognitive Science > Problem Solving (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found