Language Models are Open Knowledge Graphs
Wang, Chenguang, Liu, Xiao, Song, Dawn
–arXiv.org Artificial Intelligence
This paper shows how to construct knowledge graphs (KGs) from pre-trained language models (e.g., BERT, GPT-2/3), without human supervision. Popular KGs (e.g, Wikidata, NELL) are built in either a supervised or semi-supervised manner, requiring humans to create knowledge. Recent deep language models automatically acquire knowledge from large-scale corpora via pre-training. The stored knowledge has enabled the language models to improve downstream NLP tasks, e.g., answering questions, and writing code and articles. In this paper, we propose an unsupervised method to cast the knowledge contained within language models into KGs. We show that KGs are constructed with a single forward pass of the pre-trained language models (without fine-tuning) over the corpora. We demonstrate the quality of the constructed KGs by comparing to two KGs (Wikidata, TAC KBP) created by humans. Our KGs also provide open factual knowledge that is new in the existing KGs. Our code and KGs will be made publicly available.
arXiv.org Artificial Intelligence
Oct-22-2020
- Country:
- Africa > Middle East
- Somalia (0.04)
- Asia
- Pakistan (0.05)
- Nepal (0.04)
- Kazakhstan (0.04)
- Uzbekistan (0.04)
- Azerbaijan (0.04)
- Vietnam (0.04)
- Middle East
- Iran > Tehran Province
- Tehran (0.04)
- Iraq (0.28)
- Israel (0.04)
- Jordan (0.04)
- Lebanon (0.04)
- Republic of Türkiye (0.04)
- Syria > Damascus Governorate
- Damascus (0.04)
- Iran > Tehran Province
- Russia (0.04)
- China
- South Korea > Seoul
- Seoul (0.04)
- Kyrgyzstan (0.04)
- Turkmenistan (0.04)
- Tajikistan (0.04)
- Afghanistan (0.04)
- India > Uttar Pradesh
- Europe
- Hungary (0.04)
- Moldova (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Belgium (0.04)
- Romania (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- North Macedonia > Skopje Statistical Region
- Skopje Municipality > Skopje (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- Spain > Catalonia
- Barcelona Province (0.04)
- France (0.15)
- Norway (0.04)
- United Kingdom > England
- Berkshire > Reading (0.04)
- Oxfordshire > Oxford (0.14)
- Shropshire (0.04)
- Poland > Pomerania Province
- Gdańsk (0.04)
- Denmark (0.04)
- Germany
- Bavaria > Upper Bavaria
- Munich (0.04)
- Brandenburg > Potsdam (0.04)
- Bavaria > Upper Bavaria
- Middle East > Malta
- South Eastern Region > Southern Harbour District > Senglea (0.04)
- Bulgaria (0.04)
- Netherlands > North Holland
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Sweden > Stockholm
- Stockholm (0.05)
- North America
- Canada > Quebec
- Montreal (0.04)
- Costa Rica (0.04)
- Cuba > La Habana Province
- Havana (0.04)
- Mexico (0.04)
- United States
- Connecticut > Hartford County
- Hartford (0.04)
- California
- Los Angeles County
- Long Beach (0.04)
- Los Angeles (0.14)
- Santa Monica (0.04)
- Orange County > Irvine (0.04)
- Riverside County (0.04)
- San Francisco County > San Francisco (0.04)
- Santa Clara County > Palo Alto (0.04)
- Los Angeles County
- Pennsylvania (0.04)
- Colorado (0.04)
- Tennessee > Knox County
- Knoxville (0.04)
- Alabama > Coffee County (0.04)
- Ohio (0.04)
- New Mexico (0.04)
- Iowa (0.04)
- Virginia > Alexandria County
- Alexandria (0.04)
- Washington > Lewis County
- Centralia (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Massachusetts > Middlesex County
- Newton (0.04)
- Maryland
- Baltimore (0.04)
- Prince George's County > Fort Washington (0.04)
- Michigan > Lenawee County
- Adrian (0.04)
- Oregon (0.04)
- New York
- Bronx County > New York City (0.04)
- Queens County > New York City (0.04)
- Westchester County > New Rochelle (0.04)
- Indiana > Lake County
- Munster (0.04)
- Texas
- Bell County > Fort Hood (0.04)
- Jefferson County > Beaumont (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- New Hampshire > Cheshire County (0.04)
- Connecticut > Hartford County
- Canada > Quebec
- Oceania
- Australia (0.04)
- New Zealand (0.04)
- South America
- Brazil (0.04)
- Colombia
- Arauca Department > Arauca (0.04)
- Bogotá D.C. > Bogotá (0.04)
- Peru (0.04)
- Venezuela (0.14)
- Africa > Middle East
- Genre:
- Personal > Obituary (1.00)
- Research Report (0.81)
- Industry:
- Leisure & Entertainment > Sports
- Media
- Banking & Finance (1.00)
- Health & Medicine > Therapeutic Area
- Oncology (0.93)
- Consumer Products & Services > Hotels (0.68)
- Government
- Military > Army (0.68)
- Regional Government
- Europe Government (0.67)
- North America Government > United States Government (1.00)
- Voting & Elections (0.67)
- Education > Educational Setting (1.00)
- Transportation > Ground (0.67)
- Law > Criminal Law (1.00)
- Law Enforcement & Public Safety
- Corrections (0.67)
- Crime Prevention & Enforcement (1.00)
- Technology: