DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving
Song, Ziying, Liu, Lin, Pan, Hongyu, Liao, Bencheng, Guo, Mingzhe, Yang, Lei, Zhang, Yongchang, Xu, Shaoqing, Jia, Caiyan, Luo, Yadan
–arXiv.org Artificial Intelligence
Most end-to-end autonomous driving methods rely on imitation learning from single expert demonstrations, often leading to conservative and homogeneous behaviors that limit generalization in complex real-world scenarios. In this work, we propose DIVER, an end-to-end driving framework that integrates reinforcement learning with diffusion-based generation to produce diverse and feasible trajectories. At the core of DIVER lies a reinforced diffusion-based generation mechanism. First, the model conditions on map elements and surrounding agents to generate multiple reference trajectories from a single ground-truth trajectory, alleviating the limitations of imitation learning that arise from relying solely on single expert demonstrations. Second, reinforcement learning is employed to guide the diffusion process, where reward-based supervision enforces safety and diversity constraints on the generated trajectories, thereby enhancing their practicality and generalization capability. Furthermore, to address the limitations of L2-based open-loop metrics in capturing trajectory diversity, we propose a novel Diversity metric to evaluate the diversity of multi-mode predictions.Extensive experiments on the closed-loop NAVSIM and Bench2Drive benchmarks, as well as the open-loop nuScenes dataset, demonstrate that DIVER significantly improves trajectory diversity, effectively addressing the mode collapse problem inherent in imitation learning.
arXiv.org Artificial Intelligence
Dec-10-2025
- Country:
- Asia
- China
- Beijing > Beijing (0.05)
- Hebei Province (0.04)
- Hubei Province > Wuhan (0.04)
- Liaoning Province (0.04)
- Macao (0.14)
- Singapore (0.04)
- China
- Oceania > Australia
- Queensland (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education > Educational Setting (0.93)
- Information Technology > Robotics & Automation (0.63)
- Transportation > Ground
- Road (0.87)
- Technology: