A Modular Object Detection System for Humanoid Robots Using YOLO
Pottier, Nicolas, Lau, Meng Cheng
–arXiv.org Artificial Intelligence
Within the field of robotics, computer vision remains a significant barrier to progress, with many tasks hindered by inefficient vision systems. This research proposes a generalized vision module leveraging YOLOv9, a state-of-the-art framework optimized for computationally constrained environments like robots. The model is trained on a dataset tailored to the FIRA robotics Hurocup. A new vision module is implemented in ROS1 using a virtual environment to enable YOLO compatibility. Performance is evaluated using metrics such as frames per second (FPS) and Mean Average Precision (mAP). Performance is then compared to the existing geometric framework in static and dynamic contexts. The YOLO model achieved comparable precision at a higher computational cost then the geometric model, while providing improved robustness.
arXiv.org Artificial Intelligence
Oct-16-2025
- Genre:
- Research Report (0.83)
- Industry:
- Health & Medicine (0.46)
- Information Technology (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Vision (1.00)
- Robots (1.00)
- Machine Learning
- Neural Networks > Deep Learning (0.95)
- Performance Analysis (0.88)
- Information Technology > Artificial Intelligence