WildQA: In-the-Wild Video Question Answering

Castro, Santiago, Deng, Naihao, Huang, Pingxuan, Burzo, Mihai, Mihalcea, Rada

Sep-14-2022–arXiv.org Artificial Intelligence

Existing video understanding datasets mostly focus on human interactions, with little attention being paid to the "in the wild" settings, where the videos are recorded outdoors. We propose WILDQA, a video understanding dataset of videos recorded in outside settings. In addition to video question answering (Video QA), we also introduce the new task of identifying visual support for a given question and answer (Video Evidence Selection). Through evaluations using a wide range of baseline models, we show that WILDQA poses new challenges to the vision and language research communities. The dataset is available at https://lit.eecs.umich.edu/wildqa/.

natural language, question answering, wildqa, (1 more...)

arXiv.org Artificial Intelligence

Sep-14-2022

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.60)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found