A Survey on Machine Reading Comprehension: Tasks, Evaluation Metrics, and Benchmark Datasets
Zeng, Chengchang, Li, Shaobo, Li, Qin, Hu, Jie, Hu, Jianjun
–arXiv.org Artificial Intelligence
Machine Reading Comprehension (MRC) is a challenging NLP research field with wide real world applications. The great progress of this field in recent years is mainly due to the emergence of large-scale datasets and deep learning. At present, a lot of MRC models have already surpassed the human performance on many datasets despite the obvious giant gap between existing MRC models and genuine human-level reading comprehension. This shows the need of improving existing datasets, evaluation metrics and models to move the MRC models toward 'real' understanding. To address this lack of comprehensive survey of existing MRC tasks, evaluation metrics and datasets, herein, (1) we analyzed 57 MRC tasks and datasets; proposed a more precise classification method of MRC tasks with 4 different attributes (2) we summarized 9 evaluation metrics of MRC tasks and (3) 7 attributes and 10 characteristics of MRC datasets; (4) We also discussed some open issues in MRC research and highlight some future research directions. In addition, to help the community, we have collected, organized, and published our data on a companion website(https://mrc-datasets.github.io/) where MRC researchers could directly access each MRC dataset, papers, baseline projects and browse the leaderboard.
arXiv.org Artificial Intelligence
Jun-21-2020
- Country:
- Indian Ocean > Arabian Sea (0.04)
- Oceania > Australia
- North America
- United States
- South Carolina (0.04)
- Texas > Travis County
- Austin (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Maryland > Prince George's County
- College Park (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Washington > King County
- Seattle (0.04)
- Pennsylvania
- Allegheny County > Pittsburgh (0.04)
- Philadelphia County > Philadelphia (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- Canada
- Ontario > Toronto (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- United States
- Europe
- Germany > Berlin (0.04)
- Bulgaria (0.04)
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Cambridgeshire
- Cambridge (0.04)
- Scotland > City of Edinburgh
- Sweden > Uppsala County
- Uppsala (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Pakistan (0.04)
- Middle East
- Iran (0.04)
- Israel > Tel Aviv District
- Tel Aviv (0.04)
- Japan > Kyūshū & Okinawa
- Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- India > Maharashtra
- Mumbai (0.04)
- China > Sichuan Province
- Chengdu (0.04)
- Africa > Middle East
- Somalia (0.04)
- Genre:
- Research Report (1.00)
- Overview (1.00)
- Industry:
- Technology: