DriveGPT: Scaling Autoregressive Behavior Models for Driving

Huang, Xin, Wolff, Eric M., Vernaza, Paul, Phan-Minh, Tung, Chen, Hongge, Hayden, David S., Edmonds, Mark, Pierce, Brian, Chen, Xinxin, Jacob, Pratik Elias, Chen, Xiaobai, Tairbekov, Chingiz, Agarwal, Pratik, Gao, Tianshi, Chai, Yuning, Srinivasa, Siddhartha

Dec-18-2024–arXiv.org Artificial Intelligence

We present DriveGPT, a scalable behavior model for autonomous driving. We model driving as a sequential decision making task, and learn a transformer model to predict future agent states as tokens in an autoregressive fashion. We scale up our model parameters and training data by multiple orders of magnitude, enabling us to explore the scaling properties in terms of dataset size, model parameters, and compute. We evaluate DriveGPT across different scales in a planning task, through both quantitative metrics and qualitative examples including closed-loop driving in complex real-world scenarios. In a separate prediction task, DriveGPT outperforms a state-of-the-art baseline and exhibits improved performance by pretraining on a large-scale dataset, further validating the benefits of data scaling.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Dec-18-2024

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East > Jordan (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Energy (0.48)
- Transportation > Ground
  - Road (0.36)

Technology:
- Information Technology > Artificial Intelligence
  - Robots (1.00)
  - Machine Learning (1.00)
  - Natural Language > Large Language Model (0.70)
  - Representation & Reasoning > Agents (0.46)