A Survey on Machine Reading Comprehension: Tasks, Evaluation Metrics, and Benchmark Datasets
Zeng, Chengchang, Li, Shaobo, Li, Qin, Hu, Jie, Hu, Jianjun
–arXiv.org Artificial Intelligence
Machine Reading Comprehension (MRC) is a challenging NLP research field with wide real world applications. The great progress of this field in recent years is mainly due to the emergence of large-scale datasets and deep learning. At present, a lot of MRC models have already surpassed the human performance on many datasets despite the obvious giant gap between existing MRC models and genuine human-level reading comprehension. This shows the need of improving existing datasets, evaluation metrics and models to move the MRC models toward 'real' understanding. To address this lack of comprehensive survey of existing MRC tasks, evaluation metrics and datasets, herein, (1) we analyzed 57 MRC tasks and datasets; proposed a more precise classification method of MRC tasks with 4 different attributes (2) we summarized 9 evaluation metrics of MRC tasks and (3) 7 attributes and 10 characteristics of MRC datasets; (4) We also discussed some open issues in MRC research and highlight some future research directions. In addition, to help the community, we have collected, organized, and published our data on a companion website(https://mrc-datasets.github.io/) where MRC researchers could directly access each MRC dataset, papers, baseline projects and browse the leaderboard.
arXiv.org Artificial Intelligence
Jun-21-2020
- Country:
- Africa > Middle East
- Somalia (0.04)
- Asia
- China > Sichuan Province
- Chengdu (0.04)
- India > Maharashtra
- Mumbai (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- Middle East
- Iran (0.04)
- Israel > Tel Aviv District
- Tel Aviv (0.04)
- Pakistan (0.04)
- China > Sichuan Province
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Bulgaria (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Germany > Berlin (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Sweden > Uppsala County
- Uppsala (0.04)
- United Kingdom
- England > Cambridgeshire
- Cambridge (0.04)
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Cambridgeshire
- Belgium > Brussels-Capital Region
- Indian Ocean > Arabian Sea (0.04)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Puerto Rico > San Juan
- San Juan (0.04)
- United States
- Pennsylvania
- Allegheny County > Pittsburgh (0.04)
- Philadelphia County > Philadelphia (0.04)
- Washington > King County
- Seattle (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Prince George's County
- College Park (0.04)
- South Carolina (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Texas > Travis County
- Austin (0.04)
- Pennsylvania
- Canada
- Oceania > Australia
- Africa > Middle East
- Genre:
- Overview (1.00)
- Research Report (1.00)
- Industry:
- Technology: