VG4D: Vision-Language Model Goes 4D Video Recognition