A Language Agent for Autonomous Driving
Mao, Jiageng, Ye, Junjie, Qian, Yuxi, Pavone, Marco, Wang, Yue
–arXiv.org Artificial Intelligence
Human-level driving is an ultimate goal of autonomous driving. Conventional approaches formulate autonomous driving as a perception-prediction-planning framework, yet their systems do not capitalize on the inherent reasoning ability and experiential knowledge of humans. In this paper, we propose a fundamental paradigm shift from current pipelines, exploiting Large Language Models (LLMs) as a cognitive agent to integrate human-like intelligence into autonomous driving systems. Our approach, termed Agent-Driver, transforms the traditional autonomous driving pipeline by introducing a versatile tool library accessible via function calls, a cognitive memory of common sense and experiential knowledge for decision-making, and a reasoning engine capable of chain-of-thought reasoning, task planning, motion planning, and self-reflection. Powered by LLMs, our Agent-Driver is endowed with intuitive common sense and robust reasoning capabilities, thus enabling a more nuanced, human-like approach to autonomous driving. We evaluate our approach on the large-scale nuScenes benchmark, and extensive experiments substantiate that our Agent-Driver significantly outperforms the state-of-the-art driving methods by a large margin. Our approach also demonstrates superior interpretability and few-shot learning ability to these methods. Code will be released.
arXiv.org Artificial Intelligence
Nov-27-2023
- Country:
- North America > United States
- California > Santa Clara County > Palo Alto (0.04)
- Asia > Middle East
- Republic of Türkiye > Karaman Province > Karaman (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Information Technology > Robotics & Automation (1.00)
- Automobiles & Trucks (1.00)
- Transportation > Ground
- Road (1.00)
- Technology: