Bootstrapping Vision-Language Learning with Decoupled Language Pre-training

Open in new window