IELM: An Open Information Extraction Benchmark for Pre-Trained Language Models
Wang, Chenguang, Liu, Xiao, Song, Dawn
–arXiv.org Artificial Intelligence
We introduce a new open information extraction (OIE) benchmark for pre-trained language models (LM). Recent studies have demonstrated that pre-trained LMs, such as BERT and GPT, may store linguistic and relational knowledge. In particular, LMs are able to answer ``fill-in-the-blank'' questions when given a pre-defined relation category. Instead of focusing on pre-defined relations, we create an OIE benchmark aiming to fully examine the open relational information present in the pre-trained LMs. We accomplish this by turning pre-trained LMs into zero-shot OIE systems. Surprisingly, pre-trained LMs are able to obtain competitive performance on both standard OIE datasets (CaRB and Re-OIE2016) and two new large-scale factual OIE datasets (TAC KBP-OIE and Wikidata-OIE) that we establish via distant supervision. For instance, the zero-shot pre-trained LMs outperform the F1 score of the state-of-the-art supervised OIE methods on our factual OIE datasets without needing to use any training sets. Our code and datasets are available at https://github.com/cgraywang/IELM
arXiv.org Artificial Intelligence
Oct-25-2022
- Country:
- Africa > Middle East
- Somalia (0.04)
- Asia
- China
- India > Uttar Pradesh
- Lucknow (0.04)
- Middle East
- Iran > Tehran Province
- Tehran (0.04)
- Iraq (0.28)
- Israel (0.04)
- Jordan (0.04)
- Lebanon (0.04)
- Republic of Türkiye (0.04)
- Iran > Tehran Province
- Nepal (0.04)
- Pakistan (0.05)
- Russia (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Vietnam (0.04)
- Europe
- Hungary (0.04)
- Moldova (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Belgium (0.04)
- North Macedonia > Skopje Statistical Region
- Skopje Municipality > Skopje (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- Spain > Catalonia
- Barcelona Province (0.04)
- Italy (0.04)
- France (0.15)
- Norway (0.04)
- United Kingdom > England
- Berkshire > Reading (0.04)
- Shropshire (0.04)
- Poland > Pomerania Province
- Gdańsk (0.04)
- Denmark (0.04)
- Netherlands (0.04)
- Middle East > Malta
- South Eastern Region > Southern Harbour District > Senglea (0.04)
- Bulgaria (0.04)
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- Costa Rica (0.04)
- Cuba > La Habana Province
- Havana (0.04)
- Mexico (0.04)
- United States
- Connecticut > Hartford County
- Hartford (0.04)
- California
- Los Angeles County
- Long Beach (0.04)
- Los Angeles (0.14)
- Santa Monica (0.04)
- San Francisco County > San Francisco (0.04)
- Santa Clara County > Palo Alto (0.14)
- Los Angeles County
- Pennsylvania (0.04)
- Colorado (0.04)
- Tennessee > Knox County
- Knoxville (0.04)
- Alabama > Coffee County (0.04)
- Ohio (0.04)
- New Mexico (0.04)
- Iowa (0.04)
- Virginia > Alexandria County
- Alexandria (0.04)
- Washington > Lewis County
- Centralia (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Massachusetts > Middlesex County
- Newton (0.04)
- Maryland
- Baltimore (0.04)
- Prince George's County > Fort Washington (0.04)
- Michigan > Lenawee County
- Adrian (0.04)
- Oregon (0.04)
- New York
- Bronx County > New York City (0.04)
- Queens County > New York City (0.04)
- Westchester County > New Rochelle (0.04)
- Indiana > Lake County
- Munster (0.04)
- Texas
- Bell County > Fort Hood (0.04)
- Jefferson County > Beaumont (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Connecticut > Hartford County
- Canada > Quebec
- Oceania
- Australia (0.04)
- New Zealand (0.04)
- South America
- Colombia
- Arauca Department > Arauca (0.04)
- Bogotá D.C. > Bogotá (0.04)
- Peru (0.04)
- Colombia
- Africa > Middle East
- Genre:
- Personal > Obituary (1.00)
- Research Report > New Finding (1.00)
- Industry:
- Leisure & Entertainment > Sports
- Media
- Transportation (0.93)
- Banking & Finance (1.00)
- Education (1.00)
- Health & Medicine > Therapeutic Area
- Oncology (0.93)
- Government
- Military > Army (0.68)
- Regional Government > North America Government
- United States Government (1.00)
- Law > Criminal Law (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Technology: