Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Open in new window