GLEN: General-Purpose Event Detection for Thousands of Types
Zhan, Qiusi, Li, Sha, Conger, Kathryn, Palmer, Martha, Ji, Heng, Han, Jiawei
–arXiv.org Artificial Intelligence
The progress of event extraction research has been hindered by the absence of wide-coverage, large-scale datasets. To make event extraction systems more accessible, we build a general-purpose event detection dataset GLEN, which covers 205K event mentions with 3,465 different types, making it more than 20x larger in ontology than today's largest event dataset. GLEN is created by utilizing the DWD Overlay, which provides a mapping between Wikidata Qnodes and PropBank rolesets. This enables us to use the abundant existing annotation for PropBank as distant supervision. In addition, we also propose a new multi-stage event detection model CEDAR specifically designed to handle the large ontology size in GLEN. We show that our model exhibits superior performance compared to a range of baselines including InstructGPT. Finally, we perform error analysis and show that label noise is still the largest challenge for improving performance for this new dataset. Our dataset, code, and models are released at \url{https://github.com/ZQS1943/GLEN}.}
arXiv.org Artificial Intelligence
Oct-31-2023
- Country:
- Africa > Middle East (0.04)
- Asia
- China
- Middle East
- Israel (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Europe
- France
- Italy > Tuscany
- Florence (0.04)
- Middle East (0.04)
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- United States
- California > Orange County
- San Juan Capistrano (0.04)
- Colorado
- Boulder County > Boulder (0.04)
- Denver County > Denver (0.04)
- Illinois > Champaign County
- Urbana (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New Mexico (0.04)
- Virginia > Norfolk City County
- Norfolk (0.04)
- Washington > King County
- Seattle (0.04)
- California > Orange County
- Canada
- Oceania > Australia
- Genre:
- Research Report (0.50)
- Industry:
- Government > Military (0.46)
- Health & Medicine (1.00)
- Law (1.00)
- Technology: