VG4D: Vision-Language Model Goes 4D Video Recognition

Open in new window