GPT-Driver: Learning to Drive with GPT
Mao, Jiageng, Qian, Yuxi, Ye, Junjie, Zhao, Hang, Wang, Yue
–arXiv.org Artificial Intelligence
We present a simple yet effective approach that can transform the OpenAI GPT-3.5 model into a reliable motion planner for autonomous vehicles. Motion planning is a core challenge in autonomous driving, aiming to plan a driving trajectory that is safe and comfortable. Existing motion planners predominantly leverage heuristic methods to forecast driving trajectories, yet these approaches demonstrate insufficient generalization capabilities in the face of novel and unseen driving scenarios. In this paper, we propose a novel approach to motion planning that capitalizes on the strong reasoning capabilities and generalization potential inherent to Large Language Models (LLMs). The fundamental insight of our approach is the reformulation of motion planning as a language modeling problem, a perspective not previously explored. Specifically, we represent the planner inputs and outputs as language tokens, and leverage the LLM to generate driving trajectories through a language description of coordinate positions. With this strategy, the LLM can describe highly precise trajectory coordinates and also its internal decision-making process in natural language. We evaluate our approach on the large-scale nuScenes dataset, and extensive experiments substantiate the effectiveness, generalization ability, and interpretability of our GPT-based motion planner. Code is now available here.
arXiv.org Artificial Intelligence
Dec-5-2023
- Country:
- Asia > Middle East
- Republic of Türkiye > Karaman Province > Karaman (0.04)
- North America > United States
- California (0.14)
- Asia > Middle East
- Genre:
- Research Report > Promising Solution (0.66)
- Industry:
- Information Technology > Robotics & Automation (0.37)
- Transportation > Ground
- Road (0.52)
- Technology: