DriveGPT: Scaling Autoregressive Behavior Models for Driving
Huang, Xin, Wolff, Eric M., Vernaza, Paul, Phan-Minh, Tung, Chen, Hongge, Hayden, David S., Edmonds, Mark, Pierce, Brian, Chen, Xinxin, Jacob, Pratik Elias, Chen, Xiaobai, Tairbekov, Chingiz, Agarwal, Pratik, Gao, Tianshi, Chai, Yuning, Srinivasa, Siddhartha
–arXiv.org Artificial Intelligence
We present DriveGPT, a scalable behavior model for autonomous driving. We model driving as a sequential decision making task, and learn a transformer model to predict future agent states as tokens in an autoregressive fashion. We scale up our model parameters and training data by multiple orders of magnitude, enabling us to explore the scaling properties in terms of dataset size, model parameters, and compute. We evaluate DriveGPT across different scales in a planning task, through both quantitative metrics and qualitative examples including closed-loop driving in complex real-world scenarios. In a separate prediction task, DriveGPT outperforms a state-of-the-art baseline and exhibits improved performance by pretraining on a large-scale dataset, further validating the benefits of data scaling.
arXiv.org Artificial Intelligence
Dec-18-2024
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Energy (0.48)
- Transportation > Ground
- Road (0.36)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Natural Language > Large Language Model (0.70)
- Representation & Reasoning > Agents (0.46)
- Robots (1.00)
- Information Technology > Artificial Intelligence