AITopics

1206.657

Country: Asia > China (0.28)

Genre: Research Report (0.40)

Industry: Health & Medicine (0.31)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.34)

Riegler, Erwin, Kirkelund, Gunvor Elisabeth, Manchón, Carles Navarro, Badiu, Mihai-Alin, Fleury, Bernard Henry

Merging Belief Propagation and the Mean Field Approximation: A Free Energy Approach

arXiv.org Machine LearningJun-28-2012

We present a joint message passing approach that combines belief propagation and the mean field approximation. Our analysis is based on the region-based free energy approximation method proposed by Yedidia et al. We show that the message passing fixed-point equations obtained with this combination correspond to stationary points of a constrained region-based free energy approximation. Moreover, we present a convergent implementation of these message passing fixedpoint equations provided that the underlying factor graph fulfills certain technical conditions. In addition, we show how to include hard constraints in the part of the factor graph corresponding to belief propagation. Finally, we demonstrate an application of our method to iterative channel estimation and decoding in an orthogonal frequency division multiplexing (OFDM) system.

approximation, artificial intelligence, belief revision, (16 more...)

1112.0467

Country:

Europe > France (0.28)
Europe > Denmark (0.14)
Europe > Austria (0.14)
(11 more...)

Genre: Research Report (0.64)

Industry: Energy > Oil & Gas (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.90)

Hazan, Tamir, Jaakkola, Tommi

On the Partition Function and Random Maximum A-Posteriori Perturbations

In this paper we relate the partition function to the max-statistics of random variables. In particular, we provide a novel framework for approximating and bounding the partition function using MAP inference on randomly perturbed models. As a result, we can use efficient MAP solvers such as graph-cuts to evaluate the corresponding partition function. We show that our method excels in the typical "high signal - high coupling" regime that results in ragged energy landscapes difficult for alternative approaches.

partition function, perturbation, random map perturbation, (13 more...)

1206.641

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(3 more...)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

arXiv.org Artificial IntelligenceJun-27-2012

Infinite Hidden Relational Models

Xu, Zhao, Tresp, Volker, Yu, Kai, Kriegel, Hans-Peter

In many cases it makes sense to model a relationship symmetrically, not implying any particular directionality. Consider the classical example of a recommendation system where the rating of an item by a user should symmetrically be dependent on the attributes of both the user and the item. The attributes of the (known) relationships are also relevant for predicting attributes of entities and for predicting attributes of new relations. In recommendation systems, the exploitation of relational attributes is often referred to as collaborative filtering. Again, in many applications one might prefer to model the collaborative effect in a symmetrical way. In this paper we present a relational model, which is completely symmetrical. The key innovation is that we introduce for each entity (or object) an infinite-dimensional latent variable as part of a Dirichlet process (DP) model. We discuss inference in the model, which is based on a DP Gibbs sampler, i.e., the Chinese restaurant process. We extend the Chinese restaurant process to be applicable to relational modeling. Our approach is evaluated in three applications. One is a recommendation system based on the MovieLens data set. The second application concerns the prediction of the function of yeast genes/proteins on the data set of KDD Cup 2001 using a multi-relational model. The third application involves a relational medical domain. The experimental results show that our model gives significantly improved estimates of attributes describing relationships or entities in complex relational models.

latent variable, procedure, relational model, (14 more...)

1206.6864

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
North America > United States > California > Santa Clara County > Stanford (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Health & Medicine (1.00)

Kiraly, Franz, Tomioka, Ryota

A Combinatorial Algebraic Approach for the Identifiability of Low-Rank Matrix Completion

In this paper, we review the problem of matrix completion and expose its intimate relations with algebraic geometry, combinatorics and graph theory. We present the first necessary and sufficient combinatorial conditions for matrices of arbitrary rank to be identifiable from a set of matrix entries, yielding theoretical constraints and new algorithms for the problem of matrix completion. We conclude by algorithmically evaluating the tightness of the given conditions and algorithms for practically relevant matrix sizes, showing that the algebraic-combinatorial approach can lead to improvements over stateof-the-art matrix completion methods.

artificial intelligence, machine learning, matrix completion, (16 more...)

1206.647

Country:

Europe (0.46)
Asia > Japan (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.48)

Sato, Issei, Nakagawa, Hiroshi

Rethinking Collapsed Variational Bayes Inference for LDA

We propose a novel interpretation of the collapsed variational Bayes inference with a zero-order Taylor expansion approximation, called CVB0 inference, for latent Dirichlet allocation (LDA). We clarify the properties of the CVB0 inference by using the alpha-divergence. We show that the CVB0 inference is composed of two different divergence projections: alpha=1 and -1. This interpretation will help shed light on CVB0 works.

artificial intelligence, machine learning, natural language, (15 more...)

1206.6435

Country: Asia (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.49)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.35)

Chen, Shang-Tse, Lin, Hsuan-Tien, Lu, Chi-Jen

An Online Boosting Algorithm with Theoretical Justifications

We study the task of online boosting -- combining online weak learners into an online strong learner. While batch boosting has a sound theoretical foundation, online boosting deserves more study from the theoretical perspective. In this paper, we carefully compare the differences between online and batch boosting, and propose a novel and reasonable assumption for the online weak learner. Based on the assumption, we design an online boosting algorithm with a strong theoretical guarantee by adapting from the offline SmoothBoost algorithm that matches the assumption closely. We further tackle the task of deciding the number of weak learners using established theoretical results for online convex programming and predicting with expert advice. Experiments on real-world data sets demonstrate that the proposed algorithm compares favorably with existing online boosting algorithms.

artificial intelligence, machine learning, weak learner, (17 more...)

1206.6422

Country:

Asia > Taiwan (0.28)
Europe > United Kingdom (0.28)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Learning Markov Network Structure using Brownian Distance Covariance

Khoshgnauz, Ehsan

Undirected graphical models, also known as Markov random fields or Markov networks, have become a part of the mainstream of statistical theory and application in recent years. These models use graphs to represent conditional independences among sets of random variables. In these graphs, the absence of an edge between two vertices means the corresponding random variables are conditionally independent, given the other variables. Learning the structure of a graph is equivalent to learning if there exists an edge between every pair of nodes in the graph. In the past decade, significant progress has been made on designing efficient algorithms to learn undirected graphs from high-dimensional observational datasets. Most of these methods are based on either the penalized maximum-likelihood estimation or penalized regression methods. Works has focused on the problem of estimating the graph in this high dimensional setting, which becomes feasible if graph is sparse.

artificial intelligence, machine learning, matrix, (15 more...)

1206.6361

Country: Asia > Middle East > Iran (0.15)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Kuroki, Manabu, Cai, Zhihong

Stratified Analysis of `Probabilities of Causation'

arXiv.org Artificial IntelligenceJun-27-2012

This paper derives new bounds for the probabilities of causation defined by Pearl (2000), namely, the probability that one observed event was a necessary (or sufficient, or both) cause of another. Tian and Pearl (2000a, 2000b) showed how to bound these probabilities using information from experimental and observational studies,with minimal assumptions about the data-generating process. We derive narrower bounds using covariates measurements that might be available in the studies. In addition, we provide identifiable case under no-prevention assumption and discuss the covariate selection problem from the viewpoint of estimation accuracy. These results provides more accurate information for public policy, legal determination of responsibility and personal decision making.

artificial intelligence, causation, probability, (14 more...)

1206.6861

Country:

North America (0.29)
Asia > Japan > Honshū > Kansai (0.14)

Genre:

Research Report > Strength Medium (0.67)
Research Report > Observational Study (0.67)
Research Report > Experimental Study (0.47)

Industry:

Health & Medicine > Epidemiology (0.95)
Law (0.86)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Degris, Thomas, Sigaud, Olivier, Wuillemin, Pierre-Henri

Chi-square Tests Driven Method for Learning the Structure of Factored MDPs

arXiv.org Artificial IntelligenceJun-27-2012

SDYNA is a general framework designed to address large stochastic reinforcement learning problems. Unlike previous model based methods in FMDPs, it incrementally learns the structure and the parameters of a RL problem using supervised learning techniques. Then, it integrates decision-theoric planning algorithms based on FMDPs to compute its policy. SPITI is an instanciation of SDYNA that exploits ITI, an incremental decision tree algorithm, to learn the reward function and the Dynamic Bayesian Networks with local structures representing the transition function of the problem. These representations are used by an incremental version of the Structured Value Iteration algorithm. In order to learn the structure, SPITI uses Chi-Square tests to detect the independence between two probability distributions. Thus, we study the relation between the threshold used in the Chi-Square test, the size of the model built and the relative error of the value function of the induced policy with respect to the optimal value. We show that, on stochastic problems, one can tune the threshold so as to generate both a compact model and an efficient policy. Then, we show that SPITI, while keeping its model compact, uses the generalization property of its learning method to perform better than a stochastic classical tabular algorithm in large RL problem with an unknown structure. We also introduce a new measure based on Chi-Square to qualify the accuracy of the model learned by SPITI. We qualitatively show that the generalization property in SPITI within the FMDP framework may prevent an exponential growth of the time required to learn the structure of large stochastic RL problems.

artificial intelligence, inductive learning, machine learning, (20 more...)

1206.6842

Country: Asia (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)