Does Vision-and-Language Pretraining Improve Lexical Grounding?

Open in new window