3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation

Open in new window