CognitiveDrone: A VLA Model and Evaluation Benchmark for Real-Time Cognitive Task Solving and Reasoning in UAVs

Lykov, Artem, Serpiva, Valerii, Khan, Muhammad Haris, Sautenkov, Oleg, Myshlyaev, Artyom, Tadevosyan, Grik, Yaqoot, Yasheerah, Tsetserukou, Dzmitry

Mar-3-2025–arXiv.org Artificial Intelligence

CognitiveDrone: A VLA Model and Evaluation Benchmark for Real-Time Cognitive T ask Solving and Reasoning in UA Vs Artem Lykov, V alerii Serpiva, Muhammad Haris Khan, Oleg Sautenkov, Artyom Myshlyaev, Grik Tadevosyan, Y asheerah Y aqoot, and Dzmitry Tsetserukou Abstract -- This paper introduces CognitiveDrone, a novel Vision-Language-Action (VLA) model tailored for complex Unmanned Aerial V ehicles (UA Vs) tasks that demand advanced cognitive abilities. Trained on a dataset comprising over 8,000 simulated flight trajectories across three key categories--Human Recognition, Symbol Understanding, and Reasoning--the model generates real-time 4D action commands based on first-person visual inputs and textual instructions. T o further enhance performance in intricate scenarios, we propose CognitiveDrone-R1, which integrates an additional Vision-Language Model (VLM) reasoning module to simplify task directives prior to high-frequency control. Experimental evaluations using our open-source benchmark, CognitiveDroneBench, reveal that while a racing-oriented model (RaceVLA) achieves an overall success rate of 31.3%, the base CognitiveDrone model reaches 59.6%, and CognitiveDrone-R1 attains a success rate of 77.2%. These results demonstrate improvements of up to 30% in critical cognitive tasks, underscoring the effectiveness of incorporating advanced reasoning capabilities into UA V control systems. Our contributions include the development of a state-of-the-art VLA model for UA V control and the introduction of the first dedicated benchmark for assessing cognitive tasks in drone operations.

available, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

Mar-3-2025

arXiv.org PDF

Add feedback

Country:
- Europe > Netherlands (0.14)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Government > Military (0.34)
- Health & Medicine
  - Consumer Health (0.34)
  - Therapeutic Area
    - Neurology (0.34)
    - Psychiatry/Psychology (0.34)
- Information Technology > Robotics & Automation (0.48)
- Transportation > Air (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science > Problem Solving (0.57)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)
  - Natural Language > Large Language Model (0.69)
  - Representation & Reasoning (1.00)
  - Robots > Autonomous Vehicles
    - Drones (0.67)