OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

Sirko-Galouchenko, Sophia, Boulch, Alexandre, Gidaris, Spyros, Bursuc, Andrei, Vobecky, Antonin, Pérez, Patrick, Marlet, Renaud

Jun-12-2024–arXiv.org Artificial Intelligence

We introduce a self-supervised pretraining method, called OccFeat, for camera-only Bird's-Eye-View (BEV) segmentation networks. With OccFeat, we pretrain a BEV network via occupancy prediction and feature distillation tasks. Occupancy prediction provides a 3D geometric understanding of the scene to the model. However, the geometry learned is class-agnostic. Hence, we add semantic information to the model in the 3D space through distillation from a self-supervised pretrained image foundation model. Models pretrained with our method exhibit improved BEV semantic segmentation performance, particularly in low-data scenarios. Moreover, empirical results affirm the efficacy of integrating feature distillation with 3D occupancy prediction in our pretraining approach. Repository: https://github.com/valeoai/Occfeat

detection, occfeat, representation, (14 more...)

arXiv.org Artificial Intelligence

Jun-12-2024

arXiv.org PDF

Add feedback

Country:
- Europe
  - Czechia > Prague (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)

Genre:
- Research Report (0.64)

Industry:
- Education (0.66)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.70)
  - Artificial Intelligence
    - Vision (1.00)
    - Machine Learning > Neural Networks (0.68)
    - Natural Language > Text Processing (0.66)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found