E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning

Open in new window