ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots

Hsiao, Yu-Chung, Zubach, Fedir, Wang, Maria, Jindong, null, Chen, null

arXiv.org Artificial Intelligence 

We present a new task and dataset, ScreenQA, for screen content understanding via question answering. The existing screen datasets are focused either on structure and component-level understanding, or on a much higher-level composite task such as navigation and task completion. We attempt to bridge the gap between these two by annotating 80,000+ question-answer pairs over the RICO dataset in hope to benchmark the screen reading comprehension capacity.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found