Iterative Document-level Information Extraction via Imitation Learning
Chen, Yunmo, Gantt, William, Gu, Weiwei, Chen, Tongfei, White, Aaron Steven, Van Durme, Benjamin
–arXiv.org Artificial Intelligence
We present a novel iterative extraction model, IterX, for extracting complex relations, or templates (i.e., N-tuples representing a mapping from named slots to spans of text) within a document. Documents may feature zero or more instances of a template of any given type, and the task of template extraction entails identifying the templates in a document and extracting each template's slot values. Our imitation learning approach casts the problem as a Markov decision process (MDP), and relieves the need to use predefined template orders to train an extractor. It leads to state-of-the-art results on two established benchmarks -- 4-ary relation extraction on SciREX and template extraction on MUC-4 -- as well as a strong baseline on the new BETTER Granular task.
arXiv.org Artificial Intelligence
May-1-2023
- Country:
- Asia (1.00)
- Europe (1.00)
- North America
- Canada (0.67)
- United States > Minnesota (0.28)
- Genre:
- Research Report > New Finding (0.46)
- Workflow (0.67)
- Industry:
- Government
- Immigration & Customs (0.67)
- Regional Government (0.93)
- Law Enforcement & Public Safety (0.67)
- Government
- Technology: