Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases
Weikum, Gerhard, Dong, Luna, Razniewski, Simon, Suchanek, Fabian
–arXiv.org Artificial Intelligence
Equipping machines with comprehensive knowledge of the world's entities and their relationships has been a long-standing goal of AI. Over the last decade, large-scale knowledge bases, also known as knowledge graphs, have been automatically constructed from web contents and text sources, and have become a key asset for search engines. This machine knowledge can be harnessed to semantically interpret textual phrases in news, social media and web tables, and contributes to question answering, natural language processing and data analytics. This article surveys fundamental concepts and practical methods for creating and curating large knowledge bases. It covers models and methods for discovering and canonicalizing entities and their semantic types and organizing them into clean taxonomies. On top of this, the article discusses the automatic extraction of entity-centric properties. To support the long-term life-cycle and the quality assurance of machine knowledge, the article presents methods for constructing open schemas and for knowledge curation. Case studies on academic projects and industrial knowledge graphs complement the survey of concepts and methods.
arXiv.org Artificial Intelligence
Sep-24-2020
- Country:
- Africa > Middle East (0.04)
- Oceania
- Vanuatu (0.04)
- Australia
- New South Wales > Sydney (0.04)
- Australian Capital Territory > Canberra (0.04)
- North America
- United States
- New York (0.04)
- Tennessee (0.04)
- District of Columbia > Washington (0.04)
- Hawaii (0.04)
- Colorado (0.04)
- Minnesota
- St. Louis County > Duluth (0.04)
- Saint Louis County > Duluth (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- Mississippi > Lee County
- Tupelo (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Canada > Ontario
- Toronto (0.04)
- United States
- Europe
- Denmark (0.04)
- Middle East (0.04)
- Italy (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Isle of Wight (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Germany
- Berlin (0.13)
- Saarland > Saarbrücken (0.04)
- Saxony > Leipzig (0.04)
- France > Île-de-France
- Austria > Tyrol
- Innsbruck (0.04)
- Asia
- Middle East > Jordan (0.04)
- Malaysia > Kuala Lumpur
- Kuala Lumpur (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.13)
- China > Jiangsu Province
- Nanjing (0.04)
- Genre:
- Overview (1.00)
- Instructional Material (1.00)
- Research Report > New Finding (0.67)
- Personal > Honors
- Award (0.46)
- Industry:
- Consumer Products & Services (1.00)
- Law (1.00)
- Leisure & Entertainment > Sports (1.00)
- Retail (0.92)
- Banking & Finance (0.65)
- Government > Regional Government
- Transportation > Ground
- Road (0.67)
- Information Technology
- Services (1.00)
- Security & Privacy (1.00)
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Consumer Health (1.00)
- Media
- Technology:
- Information Technology
- Knowledge Management > Knowledge Engineering (1.00)
- Artificial Intelligence
- Representation & Reasoning
- Semantic Networks (1.00)
- Expert Systems (1.00)
- Uncertainty > Bayesian Inference (0.67)
- Natural Language
- Text Processing (1.00)
- Information Retrieval (1.00)
- Information Extraction (1.00)
- Machine Learning
- Performance Analysis > Accuracy (1.00)
- Neural Networks > Deep Learning (1.00)
- Statistical Learning > Clustering (0.92)
- Learning Graphical Models
- Undirected Networks > Markov Models (1.00)
- Directed Networks > Bayesian Learning (0.67)
- Representation & Reasoning
- Information Technology