Retrieval-based Knowledge Augmented Vision Language Pre-training

Open in new window