DINO Pre-training for Vision-based End-to-end Autonomous Driving
Juneja, Shubham, Daniušis, Povilas, Marcinkevičius, Virginijus
–arXiv.org Artificial Intelligence
In this article, we focus on the pre-training of visual autonomous driving agents in the context of imitation learning. Current methods often rely on a classification-based pre-training, which we hypothesise to be holding back from extending capabilities of implicit image understanding. We propose pre-training the visual encoder of a driving agent using the self-distillation with no labels (DINO) method, which relies on a self-supervised learning paradigm.% and is trained on an unrelated task. Our experiments in CARLA environment in accordance with the Leaderboard benchmark reveal that the proposed pre-training is more efficient than classification-based pre-training, and is on par with the recently proposed pre-training based on visual place recognition (VPRPre).
arXiv.org Artificial Intelligence
Jul-15-2024
- Country:
- North America > United States
- Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > Lithuania
- Vilnius County > Vilnius (0.04)
- Kaunas County > Kaunas (0.04)
- North America > United States
- Genre:
- Research Report (0.50)
- Industry:
- Automobiles & Trucks (0.89)
- Information Technology > Robotics & Automation (0.74)
- Transportation > Ground
- Road (0.89)
- Technology: