Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering

Open in new window