Multimodal Causal Reasoning for UAVObject Detection
–Neural Information Processing Systems
Unmanned Aerial Vehicle (UAV) object detection faces significant challenges due to complex environmental conditions and different imaging conditions. These factors introduce significant changes in scale and appearance, particularly for small objects that occupy limited pixels and exhibit limited information, complicating detection tasks. To address these challenges, we propose a Multimodel Causal Reasoning framework based on YOLO backbone for UAVObject Detection (MCR-UOD). The key idea is to use the backdoor adjustment to discover the condition-invariant object representation for easy detection. Specifically, the YOLO backbone is first adjusted to incorporate the pre-trained vision-language model.
Neural Information Processing Systems
Jun-23-2026, 03:10:47 GMT
- Country:
- Asia (0.28)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Health & Medicine (0.67)
- Media (0.46)
- Information Technology > Robotics & Automation (0.34)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (1.00)
- Artificial Intelligence
- Vision (1.00)
- Natural Language (1.00)
- Representation & Reasoning > Model-Based Reasoning (0.61)
- Robots > Autonomous Vehicles
- Drones (0.66)
- Machine Learning
- Performance Analysis > Accuracy (1.00)
- Neural Networks (0.68)
- Information Technology