ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots
Hsiao, Yu-Chung, Zubach, Fedir, Wang, Maria, Jindong, null, Chen, null
–arXiv.org Artificial Intelligence
We present a new task and dataset, ScreenQA, for screen content understanding via question answering. The existing screen datasets are focused either on structure and component-level understanding, or on a much higher-level composite task such as navigation and task completion. We attempt to bridge the gap between these two by annotating 80,000+ question-answer pairs over the RICO dataset in hope to benchmark the screen reading comprehension capacity.
arXiv.org Artificial Intelligence
Sep-16-2022
- Country:
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Italy > Tuscany
- Florence (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Western Europe (0.04)
- Belgium > Brussels-Capital Region
- North America
- Dominican Republic (0.04)
- United States
- California > Los Angeles County
- Long Beach (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York > New York County
- New York City (0.05)
- Texas > Travis County
- Austin (0.04)
- Utah > Salt Lake County
- Salt Lake City (0.04)
- California > Los Angeles County
- South America > Chile
- Europe
- Genre:
- Research Report (0.40)
- Technology: