Language Models are Open Knowledge Graphs
Wang, Chenguang, Liu, Xiao, Song, Dawn
–arXiv.org Artificial Intelligence
This paper shows how to construct knowledge graphs (KGs) from pre-trained language models (e.g., BERT, GPT-2/3), without human supervision. Popular KGs (e.g, Wikidata, NELL) are built in either a supervised or semi-supervised manner, requiring humans to create knowledge. Recent deep language models automatically acquire knowledge from large-scale corpora via pre-training. The stored knowledge has enabled the language models to improve downstream NLP tasks, e.g., answering questions, and writing code and articles. In this paper, we propose an unsupervised method to cast the knowledge contained within language models into KGs. We show that KGs are constructed with a single forward pass of the pre-trained language models (without fine-tuning) over the corpora. We demonstrate the quality of the constructed KGs by comparing to two KGs (Wikidata, TAC KBP) created by humans. Our KGs also provide open factual knowledge that is new in the existing KGs. Our code and KGs will be made publicly available.
arXiv.org Artificial Intelligence
Oct-22-2020
- Country:
- South America
- Venezuela (0.14)
- Peru (0.04)
- Brazil (0.04)
- Colombia
- Bogotá D.C. > Bogotá (0.04)
- Arauca Department > Arauca (0.04)
- Oceania
- New Zealand (0.04)
- Australia (0.04)
- North America
- Mexico (0.04)
- Costa Rica (0.04)
- United States
- New Hampshire > Cheshire County (0.04)
- Ohio (0.04)
- Alabama > Coffee County (0.04)
- Pennsylvania (0.04)
- Oregon (0.04)
- Iowa (0.04)
- New Mexico (0.04)
- Colorado (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Texas
- Jefferson County > Beaumont (0.04)
- Bell County > Fort Hood (0.04)
- Indiana > Lake County
- Munster (0.04)
- New York
- Bronx County > New York City (0.04)
- Westchester County > New Rochelle (0.04)
- Queens County > New York City (0.04)
- Michigan > Lenawee County
- Adrian (0.04)
- Maryland
- Baltimore (0.04)
- Prince George's County > Fort Washington (0.04)
- Massachusetts > Middlesex County
- Newton (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Washington > Lewis County
- Centralia (0.04)
- Virginia > Alexandria County
- Alexandria (0.04)
- Tennessee > Knox County
- Knoxville (0.04)
- California
- Santa Clara County > Palo Alto (0.04)
- Orange County > Irvine (0.04)
- Riverside County (0.04)
- San Francisco County > San Francisco (0.04)
- Los Angeles County
- Los Angeles (0.14)
- Santa Monica (0.04)
- Long Beach (0.04)
- Connecticut > Hartford County
- Hartford (0.04)
- Cuba > La Habana Province
- Havana (0.04)
- Canada > Quebec
- Montreal (0.04)
- Europe
- France (0.15)
- Bulgaria (0.04)
- Denmark (0.04)
- Norway (0.04)
- Romania (0.04)
- Hungary (0.04)
- Belgium (0.04)
- Moldova (0.04)
- Sweden > Stockholm
- Stockholm (0.05)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Netherlands > North Holland
- Middle East > Malta
- South Eastern Region > Southern Harbour District > Senglea (0.04)
- Germany
- Brandenburg > Potsdam (0.04)
- Bavaria > Upper Bavaria
- Munich (0.04)
- Poland > Pomerania Province
- Gdańsk (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.14)
- Shropshire (0.04)
- Berkshire > Reading (0.04)
- Spain > Catalonia
- Barcelona Province (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- North Macedonia > Skopje Statistical Region
- Skopje Municipality > Skopje (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Asia
- Pakistan (0.05)
- Azerbaijan (0.04)
- Russia (0.04)
- Afghanistan (0.04)
- Vietnam (0.04)
- Nepal (0.04)
- Tajikistan (0.04)
- Turkmenistan (0.04)
- Kyrgyzstan (0.04)
- Uzbekistan (0.04)
- Kazakhstan (0.04)
- India > Uttar Pradesh
- South Korea > Seoul
- Seoul (0.04)
- China
- Middle East
- Iraq (0.28)
- Lebanon (0.04)
- Jordan (0.04)
- Israel (0.04)
- Republic of Türkiye (0.04)
- Syria > Damascus Governorate
- Damascus (0.04)
- Iran > Tehran Province
- Tehran (0.04)
- Africa > Middle East
- Somalia (0.04)
- South America
- Genre:
- Personal > Obituary (1.00)
- Research Report (0.81)
- Industry:
- Law > Criminal Law (1.00)
- Education > Educational Setting (1.00)
- Banking & Finance (1.00)
- Consumer Products & Services > Hotels (0.68)
- Transportation > Ground (0.67)
- Law Enforcement & Public Safety
- Crime Prevention & Enforcement (1.00)
- Corrections (0.67)
- Government
- Military > Army (0.68)
- Voting & Elections (0.67)
- Regional Government
- North America Government > United States Government (1.00)
- Europe Government (0.67)
- Health & Medicine > Therapeutic Area
- Oncology (0.93)
- Media
- Leisure & Entertainment > Sports
- Technology: