A Survey of Vision-Language Pre-Trained Models

Open in new window