Appendix: LanguageModelswithImageDescriptors areStrongFew-ShotVideo-LanguageLearners

Open in new window