Renaissance: Investigating the Pretraining of Vision-Language Encoders

Open in new window