DINO Pre-training for Vision-based End-to-end Autonomous Driving
Juneja, Shubham, Daniušis, Povilas, Marcinkevičius, Virginijus
–arXiv.org Artificial Intelligence
In this article, we focus on the pre-training of visual autonomous driving agents in the context of imitation learning. Current methods often rely on a classification-based pre-training, which we hypothesise to be holding back from extending capabilities of implicit image understanding. We propose pre-training the visual encoder of a driving agent using the self-distillation with no labels (DINO) method, which relies on a self-supervised learning paradigm.% and is trained on an unrelated task. Our experiments in CARLA environment in accordance with the Leaderboard benchmark reveal that the proposed pre-training is more efficient than classification-based pre-training, and is on par with the recently proposed pre-training based on visual place recognition (VPRPre).
arXiv.org Artificial Intelligence
Jul-15-2024
- Genre:
- Research Report (0.50)
- Industry:
- Automobiles & Trucks (0.89)
- Information Technology > Robotics & Automation (0.74)
- Transportation > Ground
- Road (0.89)
- Technology: