To Test Machine Comprehension, Start by Defining Comprehension
Dunietz, Jesse, Burnham, Gregory, Bharadwaj, Akash, Rambow, Owen, Chu-Carroll, Jennifer, Ferrucci, David
–arXiv.org Artificial Intelligence
Many tasks aim to measure machine reading comprehension (MRC), often focusing on question types presumed to be difficult. Rarely, however, do task designers start by considering what systems should in fact comprehend. In this paper we make two key contributions. First, we argue that existing approaches do not adequately define comprehension; they are too unsystematic about what content is tested. Second, we present a detailed definition of comprehension -- a "Template of Understanding" -- for a widely useful class of texts, namely short narratives. We then conduct an experiment that strongly suggests existing systems are not up to the task of narrative understanding as we define it.
arXiv.org Artificial Intelligence
May-11-2020
- Country:
- Oceania > Australia
- North America
- United States
- Texas (0.04)
- Pennsylvania (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Maryland > Prince George's County
- College Park (0.04)
- Wisconsin > Dunn County
- Menomonie (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Santa Clara County
- Stanford (0.04)
- Washington > King County
- Seattle (0.04)
- Connecticut > New Haven County
- New Haven (0.04)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada > British Columbia
- United States
- Europe
- Germany > Berlin (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Italy > Trentino-Alto Adige/Südtirol
- Trentino Province > Trento (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia > China
- Hong Kong (0.04)
- Genre:
- Research Report (0.64)
- Industry:
- Education (0.89)
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area > Oncology (0.46)
- Technology: