UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning
Wang, Xiangyu, Yang, Donglin, Liao, Yue, Zheng, Wenhao, wu, wenjun, Dai, Bin, Li, Hongsheng, Liu, Si
–arXiv.org Artificial Intelligence
Unmanned Aerial Vehicles (UAVs) are evolving into language-interactive platforms, enabling more intuitive forms of human-drone interaction. While prior works have primarily focused on high-level planning and long-horizon navigation, we shift attention to language-guided fine-grained trajectory control, where UAVs execute short-range, reactive flight behaviors in response to language instructions. We formalize this problem as the Flying-on-a-Word (Flow) task and introduce UAV imitation learning as an effective approach. In this framework, UAVs learn fine-grained control policies by mimicking expert pilot trajectories paired with atomic language instructions. To support this paradigm, we present UAV-Flow, the first real-world benchmark for language-conditioned, fine-grained UAV control. It includes a task formulation, a large-scale dataset collected in diverse environments, a deployable control framework, and a simulation suite for systematic evaluation. Our design enables UAVs to closely imitate the precise, expert-level flight trajectories of human pilots and supports direct deployment without sim-to-real gap. We conduct extensive experiments on UAV-Flow, benchmarking VLN and VLA paradigms. Results show that VLA models are superior to VLN baselines and highlight the critical role of spatial grounding in the fine-grained Flow setting.
arXiv.org Artificial Intelligence
May-27-2025
- Country:
- Asia
- China > Zhejiang Province
- Hangzhou (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore (0.04)
- China > Zhejiang Province
- Europe
- Netherlands > North Holland
- Amsterdam (0.04)
- Switzerland (0.04)
- Netherlands > North Holland
- North America > United States (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Aerospace & Defense (0.34)
- Information Technology (0.34)
- Technology: