Using Semantics and Statistics to Turn Data into Knowledge

Pujara, Jay (University of Maryland, College Park) | Miao, Hui (University of California, Santa Cruz) | Getoor, Lise (Carnegie Mellon University) | Cohen, William W.

AI Magazine 

Many information extraction and knowledge base construction systems are addressing the challenge of deriving knowledge from text. In this article, we represent the desired knowledge base as a knowledge graph and introduce the problem of knowledge graph identification, collectively resolving the entities, labels, and relations present in the knowledge graph. Knowledge graph identification requires reasoning jointly over millions of extractions simultaneously, posing a scalability challenge to many approaches. We use probabilistic soft logic (PSL), a recently-introduced statistical relational learning framework, to implement an efficient solution to knowledge graph identification and present state-of-the-art results for knowledge graph construction while performing an order of magnitude faster than competing methods.