Pre-Training Multi-Modal Dense Retrievers for Outside-Knowledge Visual Question Answering
Salemi, Alireza, Rafiee, Mahta, Zamani, Hamed
–arXiv.org Artificial Intelligence
This paper studies a category of visual question answering tasks, in which accessing external knowledge is necessary for answering the questions. This category is called outside-knowledge visual question answering (OK-VQA). A major step in developing OK-VQA systems is to retrieve relevant documents for the given multi-modal query. Current state-of-the-art asymmetric dense retrieval model for this task uses an architecture with a multi-modal query encoder and a uni-modal document encoder. Such an architecture requires a large amount of training data for effective performance. We propose an automatic data generation pipeline for pre-training passage retrieval models for OK-VQA tasks. The proposed approach leads to 26.9% Precision@5 improvements compared to the current state-of-the-art asymmetric architecture. Additionally, the proposed pre-training approach exhibits a good ability in zero-shot retrieval scenarios.
arXiv.org Artificial Intelligence
Jun-28-2023
- Country:
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- Texas > Travis County
- Austin (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- New York > New York County
- New York City (0.05)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Maryland > Montgomery County
- Gaithersburg (0.04)
- Washington > King County
- Europe
- Switzerland (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- Spain
- Galicia > Madrid (0.04)
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Asia
- Taiwan > Taiwan Province
- Taipei (0.05)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.28)
- China
- Hong Kong (0.04)
- Tianjin Province > Tianjin (0.04)
- Taiwan > Taiwan Province
- Genre:
- Research Report > Experimental Study (0.46)
- Technology: