Country
Approximate Lifted Belief Propagation
Singla, Parag (University of Texas) | Nath, Aniruddh (University of Washington) | Domingos, Pedro (University of Washington)
Lifting can greatly reduce the cost of inference on first-order probabilistic models, but constructing the lifted network can itself be quite costly. In addition, the minimal lifted network is often very close in size to the fully propositionalized model; lifted inference yields little or no speedup in these situations. In this paper, we address both these problems. We propose a compact hypercube-based representation for the lifted network, which can greatly reduce the cost of lifted network construction. We also present two methods for approximate lifted network construction, which groups together similar but distinguishable objects and treats them as if they were identical. This can greatly reduce the size of the lifted network as well as the time required for lifted network construction, but potentially at some cost to accuracy. The coarseness of the approximation can be adjusted depending on the accuracy required, and we can bound the resulting error. Experiments on six domains show great efficiency gains with only minor loss in accuracy.
An Architectural Approach to Statistical Relational AI
Rosenbloom, Paul (University of Southern California)
The architectural approach to AI focuses on the fixed structure underlying intelligence. Applying it to statistical relational AI should further stimulate the application of statistical relational techniques across AI, while focusing research on their commonalities, (in)compatibilities and integration. It could also yield new architectures that are simpler yet more comprehensive than todayโs best.
Declarative Probabilistic Programming for Undirected Graphical Models: Open Up to Scale Up
Riedel, Sebastian Robert (University of Massachusetts)
We argue that probabilistic programming with undirected models, in order to scale up, needs to open up. That is, instead of focusing on minimal sets of generic building blocks such as universal quantification or logical connectives, languages should grow to include specific building blocks for as many uses cases as necessary. This can not only lead to more concise models, but also to more efficient inference if we use methods that can exploit building-block specific sub-routines. As embodiment of this paradigm we present , a platform for implementing probabilistic programming languages that grow.
Bayesian Abductive Logic Programs
Raghavan, Sindhu V. (The University of Texas at Austin) | Mooney, Raymond J. (The University of Texas at Austin)
In this paper, we introduce Bayesian Abductive Logic Programs (BALPs), a new formalism that integrates Bayesian Logic Programs (BLPs) and Abductive Logic Programming (ALP) for abductive reasoning. Like BLPs, BALPs also combine first-order logic and Bayesian networks. However, unlike BLPs that use logical deduction to construct Bayes nets, BALPs employ logical abduction. As a result, BALPs are more suited for solving problems like plan/activity recognition and diagnosis that require abductive reasoning. First, we present the necessary enhancements to BLPs in order to support logical abduction. Next, we apply BALPs to the task of plan recognition and demonstrate its efficacy on two data sets. We also compare the performance of BALPs with several existing approaches for abduction.
Machine Reading: A "Killer App" for Statistical Relational AI
Poon, Hoifung (University of Washington) | Domingos, Pedro (University of Washington)
Machine reading aims to automatically extract knowledge from text. It is a long-standing goal of AI and holds the promise of revolutionizing Web search and other fields. In this paper, we analyze the core challenges of machine reading and show that statistical relational AI is particularly well suited to address these challenges. We then propose a unifying approach to machine reading in which statistical relational AI plays a central role. Finally, we demonstrate the promise of this approach by presenting OntoUSP, an end-to-end machine reading system that builds on recent advances in statistical relational AI and greatly outperforms state-of-the-art systems in a task of extracting knowledge from biomedical abstracts and answering questions.
Integrating Structured Metadata with Relational Affinity Propagation
Plangprasopchok, Anon (University of Southern California/Information Sciences Institute) | Lerman, Kristina (University of Southern California/Information Sciences Institute) | Getoor, Lise (University of Maryland, College Park)
Structured and semi-structured data describing entities, taxonomies and ontologies appears in many domains. There is a huge interest in integrating structured information from multiple sources; however integrating structured data to infer complex common structures is a difficult task because the integration must aggregate similar structures while avoiding structural inconsistencies that may appear when the data is combined. In this work, we study the integration of structured social metadata: shallow personal hierarchies specified by many individual users on the Social Web, and focus on inferring a collection of integrated, consistent taxonomies. We frame this task as an optimization problem with structural constraints. We propose a new inference algorithm, which we refer to as Relational Affinity Propagation (RAP) that extends affinity propagation(Frey and Dueck, 2007) by introducing structural constraints. We validate the approach on a real-world social media dataset, collected from the photosharing website Flickr. Our empirical results show that our proposed approach is able to construct deeper and denser structures compared to an approach using only the standard affinity propagation algorithm.
Exploiting Causal Independence in Markov Logic Networks: Combining Undirected and Directed Models
Natarajan, Sriraam (University of Wisconsin Madison) | Khot, Tushar (University of Wisconsin Madison) | Lowd, Daniel (University of Oregon) | Tadepalli, Prasad (Oregon State University) | Kersting, Kristian (Fraunhofer IAIS) | Shavlik, Jude (University of Wisconsin-Madison)
A new method is proposed for compiling causal independencies into Markov logic networks. A Markov logic network can be viewed as compactly representing a factorization of a joint probability into the multiplication of a set of factors guided by logical formulas. We present a notion of causal independence that enables one to further factorize the factors into a combination of even smaller factors and consequently obtain a finer-grain factorization of the joint probability. The causal independence lets us specify the factor in terms of weighted, directed clauses and an associative and commutative operator, such as "or", "sum" or "max", on the contribution of the variables involved in the factors, hence combining both undirected and directed knowledge.
Deep Transfer as Structure Learning in Markov Logic Networks
Moore, David Andrew (Williams College) | Danyluk, Andrea Pohoreckyj (Williams College)
Learning the relational structure of a domain is a fundamental problem in statistical relational learning. The deep transfer algorithm of Davis and Domingos attempts to improve structure learning in Markov logic networks by harnessing the power of transfer learning, using the second-order structural regularities of a source domain to bias the structure search process in a target domain. We propose that the clique-scoring process which discovers these second-order regularities constitutes a novel standalone method for learning the structure of Markov logic networks, and that this fact, rather than the transfer of structural knowledge across domains, accounts for much of the performance benefit observed via the deep transfer process. This claim is supported by experiments in which we find that clique scoring within a single domain often produces results equaling or surpassing the performance of deep transfer incorporating external knowledge, and also by explicit algorithmic similarities between deep transfer and other structure learning techniques.
Leveraging Ontologies for Lifted Probabilistic Inference and Learning
Kiddon, Chloe Marielle (University of Washington) | Domingos, Pedro (University of Washington)
Exploiting ontologies for efficient inference is one of the most widely studied topics in knowledge representation and reasoning. The use of ontologies for probabilistic inference, however, is much less developed. A number of algorithms for lifted inference in first-order probabilistic languages have been proposed, but their scalability is limited by the combinatorial explosion in the sets of objects that need to be considered. We propose a coarse-to-fine inference approach that leverages a class hierarchy to combat this problem. Starting at the highest level, our approach performs inference at successively finer grains, pruning low-probability atoms before refining. We provide bounds on the error incurred by this approach relative to full ground inference as a function of the pruning threshold. We also show how to learn parameters in a coarse-to-fine manner to maximize the opportunities for pruning during inference. Experiments on link prediction and biomolecular event prediction tasks show our method can greatly improve the scalability of lifted probabilistic inference.
Online Max-Margin Weight Learning with Markov Logic Networks
Huynh, Tuyen N. (The University of Texas at Austin) | Mooney, Raymond J. (The University of Texas at Austin)
Most of the existing weight-learning algorithms for Markov Logic Networks (MLNs) use batch training which becomes computationally expensive and even infeasible for very large datasets since the training examples may not fit in main memory. To overcome this problem, previous work has used online learning algorithms to learn weights for MLNs. However, this prior work has only applied existing online algorithms, and there is no comprehensive study of online weight learning for MLNs. In this paper, we derive new online algorithms for structured prediction using the primal-dual framework, apply them to learn weights for MLNs, and compare against existing online algorithms on two large, real-world datasets. The experimental results show that the new algorithms achieve better accuracy than existing methods.