Automatic Error Analysis for Document-level Information Extraction
Das, Aliva, Du, Xinya, Wang, Barry, Shi, Kejian, Gu, Jiayuan, Porter, Thomas, Cardie, Claire
–arXiv.org Artificial Intelligence
Document-level information extraction (IE) tasks have recently begun to be revisited in earnest using the end-to-end neural network techniques that have been successful on their sentence-level IE counterparts. Evaluation of the approaches, however, has been limited in a number of dimensions. In particular, the precision/recall/F1 scores typically reported provide few insights on the range of errors the models make. We build on the work of Kummerfeld and Klein (2013) to propose a transformation-based framework for automating error analysis in document-level event and (N-ary) relation extraction. We employ our framework to compare two state-of-the-art document-level template-filling approaches on datasets from three domains; and then, to gauge progress in IE since its inception 30 years ago, vs. four systems from the MUC-4 (1992) evaluation.
arXiv.org Artificial Intelligence
Sep-15-2022
- Country:
- South America > Chile
- North America
- Honduras (0.04)
- Guatemala (0.04)
- El Salvador (0.04)
- Central America (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- Virginia > Fairfax County
- McLean (0.04)
- New York > New York County
- New York City (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Colorado > Denver County
- Denver (0.04)
- California > San Diego County
- San Diego (0.04)
- Washington > King County
- Europe
- Netherlands (0.04)
- Germany (0.04)
- France (0.04)
- Czechia > Prague (0.04)
- Belgium (0.04)
- United Kingdom > England
- Greater Manchester > Manchester (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Italy
- Asia
- China > Hong Kong (0.04)
- South Korea (0.04)
- Singapore (0.04)
- Macao (0.04)
- Middle East > Qatar
- Africa > Middle East
- Morocco (0.04)
- Genre:
- Research Report (0.82)
- Industry:
- Technology: