Directed Networks
An Expectation-Maximization Algorithm to Compute a Stochastic Factorization From Data
Barreto, Andre M. S. (National Laboratory for Scientific Computing (LNCC)) | Beirigo, Rafael L. (National Laboratory for Scientific Computing (LNCC)) | Pineau, Joelle (McGill University) | Precup, Doina (McGill University)
When a transition probability matrix is represented as the product of two stochastic matrices, swapping the factors of the multiplication yields another transition matrix that retains some fundamental characteristics of the original. Since the new matrix can be much smaller than its precursor, replacing the former for the latter can lead to significant savings in terms of computational effort. This strategy, dubbed the "stochastic-factorization trick," can be used to compute the stationary distribution of a Markov chain, to determine the fundamental matrix of an absorbing chain, and to compute a decision policy via dynamic programming or reinforcement learning. In this paper we show that the stochastic-factorization trick can also provide benefits in terms of the number of samples needed to estimate a transition matrix. We introduce a probabilistic interpretation of a stochastic factorization and build on the resulting model to develop an algorithm to compute the factorization directly from data. If the transition matrix can be well approximated by a low-order stochastic factorization, estimating its factors instead of the original matrix reduces significantly the number of parameters to be estimated. Thus, when compared to estimating the transition matrix directly via maximum likelihood, the proposed method is able to compute approximations of roughly the same quality using less data. We illustrate the effectiveness of the proposed algorithm by using it to help a reinforcement learning agent learn how to play the game of blackjack.
Tractable Learning for Structured Probability Spaces: A Case Study in Learning Preference Distributions
Choi, Arthur (University of California, Los Angeles) | Broeck, Guy Van den (University of California, Los Angeles) | Darwiche, Adnan (University of California, Los Angeles)
Probabilistic sentential decision diagrams (PSDDs) are a tractable representation of structured probability spaces, which are characterized by complex logical constraints on what constitutes a possible world. We develop general-purpose techniques for probabilistic reasoning and learning with PSDDs, allowing one to compute the probabilities of arbitrary logical formulas and to learn PSDDs from incomplete data. We illustrate the effectiveness of these techniques in the context of learning preference distributions, to which considerable work has been devoted in the past. We show, analytically and empirically, that our proposed framework is general enough to support diverse and complex data and query types. In particular, we show that it can learn maximum-likelihood models from partial rankings, pairwise preferences, and arbitrary preference constraints. Moreover, we show that it can efficiently answer many queries exactly, from expected and most likely rankings, to the probability of pairwise preferences, and diversified recommendations. This case study illustrates the effectiveness and flexibility of the developed PSDD framework as a domain-independent tool for learning and reasoning with structured probability spaces.
Probabilistic Inference in Hybrid Domains by Weighted Model Integration
Belle, Vaishak (KU Leuven) | Passerini, Andrea (University of Trento) | Broeck, Guy Van den (KU Leuven)
Weighted model counting (WMC) on a propositional knowledge base is an effective and general approach to probabilistic inference in a variety of formalisms, including Bayesian and Markov Networks. However, an inherent limitation of WMC is that it only admits the inference of discrete probability distributions. In this paper, we introduce a strict generalization of WMC called weighted model integration that is based on annotating Boolean and arithmetic constraints, and combinations thereof. This methodology is shown to capture discrete, continuous and hybrid Markov networks. We then consider the task of parameter learning for a fragment of the language. An empirical evaluation demonstrates the applicability and promise of the proposal.
Secure Routing in Wireless Sensor Networks via POMDPs
Irissappane, Athirai A. (Nanyang Technological University) | Zhang, Jie (Nanyang Technological University) | Oliehoek, Frans A. (University of Liverpool, University of Amsterdam) | Dutta, Partha S. (Rio Tinto)
Trust schemes can identify such nodes, as they Wireless sensor networks are being increasingly can predict a node's behavior (quality) both directly, via evaluation used for sustainable development. The task of routing based on its past actions, and indirectly, using recommendations in these resource-constraint networks is particularly (opinions) from other nodes. However, many challenging as they operate over prolonged trust schemes cannot effectively handle attacks targeting trust deployment periods, necessitating optimal use of systems themselves [Sun et al., 2006] i.e., they are heavily their resources. Moreover, due to the deployment affected by malicious nodes deliberately providing misleading in unattended environments, they become an easy opinions (unfair ratings) about other nodes.
A Personalised Thermal Comfort Model Using a Bayesian Network
Auffenberg, Frederik (University of Southampton) | Stein, Sebastian (University of Southampton) | Rogers, Alex (University of Southampton)
In this paper, we address the challenge of predicting optimal comfort temperatures of individual users of a smart heating system. At present, such systems use simple models of user comfort when deciding on a set point temperature. These models generally fail to adapt to an individual userโs preferences, resulting in poor estimates of a userโs preferred temperature. To address this issue, we propose a personalised thermal comfort model that uses a Bayesian network to learn and adapt to a userโs individual preferences. Through an empirical evaluation based on the ASHRAE RP-884 data set, we show that our model is consistently 17.5- 23.5% more accurate than current models, regardless of environmental conditions and the type of heating system used. Our model is not limited to a single metric but can also infer information about expected user feedback, optimal comfort temperature and thermal sensitivity at the same time, which can be used to reduce energy used for heating with minimal comfort loss.
Learning Geographical Hierarchy Features for Social Image Location Prediction
Zhang, Xiaoming (Beihang University) | Hu, Xia (Texas A and M University) | Li, Zhoujun (Beihang University)
Image location prediction is to estimate the geolocation where an image is taken. Social image contains heterogeneous contents, which makes image location prediction nontrivial. Moreover, it is observed that image content patterns and location preferences correlate hierarchically. Traditional image location prediction methods mainly adopt a single-level architecture, which is not directly adaptable to the hierarchical correlation. In this paper, we propose a geographically hierarchical bi-modal deep belief network model (GH-BDBN), which is a compositional learning architecture that integrates multi-modal deep learning model with non-parametric hierarchical prior model. GH-BDBN learns a joint representation capturing the correlations among different types of image content using a bi-modal DBN, with a geographically hierarchical prior over the joint representation to model the hierarchical correlation between image content and location. Experimental results demonstrate the superiority of our model for image location prediction.
A Scalable Community Detection Algorithm for Large Graphs Using Stochastic Block Models
Peng, Chengbin (King Abdullah University of Science and Technology) | Zhang, Zhihua (Shanghai Jiao Tong University) | Wong, Ka-Chun (City University of Hong Kong) | Zhang, Xiangliang (King Abdullah University of Science and Technology) | Keyes, David (King Abdullah University of Science and Technology)
Community detection in graphs is widely used in social and biological networks, and the stochastic block model is a powerful probabilistic tool for describing graphs with community structures. However, in the era of ''big data,'' traditional inference algorithms for such a model are increasingly limited due to their high time complexity and poor scalability. In this paper, we propose a multi-stage maximum likelihood approach to recover the latent parameters of the stochastic block model, in time linear with respect to the number of edges. We also propose a parallel algorithm based on message passing. Our algorithm can overlap communication and computation, providing speedup without compromising accuracy as the number of processors grows. For example, to process a real-world graph with about 1.3 million nodes and 10 million edges, our algorithm requires about 6 seconds on 64 cores of a contemporary commodity Linux cluster. Experiments demonstrate that the algorithm can produce high quality results on both benchmark and real-world graphs. An example of finding more meaningful communities is illustrated consequently in comparison with a popular modularity maximization algorithm.
Toward Estimating Others' Transition Models Under Occlusion for Multi-Robot IRL
Bogert, Kenneth (University of Georgia) | Doshi, Prashant (University of Georgia)
Multi-robot inverse reinforcement learning (mIRL) is broadly useful for learning, from observations, the behaviors of multiple robots executing fixed trajectories and interacting with each other. In this paper, we relax a crucial assumption in IRL to make it better suited for wider robotic applications: we allow the transition functions of other robots to be stochastic and do not assume that the transition error probabilities are known to the learner. Challenged by occlusion where large portions of others' state spaces are fully hidden, we present a new approach that maps stochastic transitions to distributions over features. Then, the underconstrained problem is solved using nonlinear optimization that maximizes entropy to learn the transition function of each robot from occluded observations. Our methods represent significant and first steps toward making mIRL pragmatic.
Inducing Probabilistic Relational Rules from Probabilistic Examples
Raedt, Luc De (KU Leuven) | Dries, Anton (KU Leuven) | Thon, Ingo (KU Leuven) | Broeck, Guy Van den (KU Leuven) | Verbeke, Mathias (KU Leuven)
We study the problem of inducing logic programs in a probabilistic setting, in which both the example descriptions and their classification can be probabilistic. The setting is incorporated in the probabilistic rule learner ProbFOIL+, which combines principles of the rule learner FOIL with ProbLog, a probabilistic Prolog. We illustrate the approach by applying it to the knowledge base of NELL, the Never-Ending Language Learner.
Sparse Probabilistic Matrix Factorization by Laplace Distribution for Collaborative Filtering
Jing, Liping (Beijing Key Lab of Traffic Data Analysis and Mining and Beijing Jiaotong University) | Wang, Peng (Beijing Key Lab of Traffic Data Analysis and Mining and Beijing Jiaotong University) | Yang, Liu (Beijing Key Lab of Traffic Data Analysis and Mining and Beijing Jiaotong University)
In recommendation systems, probabilistic matrix factorization (PMF) is a state-of-the-art collaborative filtering method by determining the latent features to represent users and items. However, two major issues limiting the usefulness of PMF are the sparsity problem and long-tail distribution. Sparsity refers to the situation that the observed rating data are sparse, which results in that only part of latent features are informative for describing each item/user. Long tail distribution implies that a large fraction of items have few ratings. In this work, we propose a sparse probabilistic matrix factorization method (SPMF) by utilizing a Laplacian distribution to model the item/user factor vector. Laplacian distribution has ability to generate sparse coding, which is beneficial for SPMF to distinguish the relevant and irrelevant latent features with respect to each item/user. Meanwhile, the tails in Laplacian distribution are comparatively heavy, which is rewarding for SPMF to recommend the tail items. Furthermore, a distributed Gibbs sampling algorithm is developed to efficiently train the proposed sparse probabilistic model. A series of experiments on Netfilix and Movielens datasets have been conducted to demonstrate that SPMF outperforms the existing PMF and its extended version Bayesian PMF (BPMF), especially for the recommendation of tail items.