AITopics

1409.5166

Country:

North America > United States (0.67)
Asia > China > Guangdong Province (0.28)

Genre: Research Report (0.81)

Industry:

Consumer Products & Services > Travel (1.00)
Transportation > Freight & Logistics Services (0.89)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.84)

Bresler, Guy, Gamarnik, David, Shah, Devavrat

Hardness of parameter estimation in graphical models

arXiv.org Artificial IntelligenceSep-17-2014

We consider the problem of learning the canonical parameters specifying an undirected graphical model (Markov random field) from the mean parameters. For graphical models representing a minimal exponential family, the canonical parameters are uniquely determined by the mean parameters, so the problem is feasible in principle. The goal of this paper is to investigate the computational feasibility of this statistical task. Our main result shows that parameter estimation is in general intractable: no algorithm can learn the canonical parameters of a generic pair-wise binary graphical model from the mean parameters in time bounded by a polynomial in the number of variables (unless RP = NP). Indeed, such a result has been believed to be true (see the monograph by Wainwright and Jordan (2008)) but no proof was known. Our proof gives a polynomial time reduction from approximating the partition function of the hard-core model, known to be hard, to learning approximate parameters. Our reduction entails showing that the marginal polytope boundary has an inherent repulsive property, which validates an optimization procedure over the polytope that does not use any knowledge of its structure (as required by the ellipsoid method and others).

artificial intelligence, hardcore model, machine learning, (17 more...)

1409.3836

Country: Asia > Middle East > Jordan (0.24)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Ishiguro, Katsuhiko, Sato, Issei, Ueda, Naonori

Collapsed Variational Bayes Inference of Infinite Relational Model

arXiv.org Machine LearningSep-16-2014

The Infinite Relational Model (IRM) is a probabilistic model for relational data clustering that partitions objects into clusters based on observed relationships. This paper presents Averaged CVB (ACVB) solutions for IRM, convergence-guaranteed and practically useful fast Collapsed Variational Bayes (CVB) inferences. We first derive ordinary CVB and CVB0 for IRM based on the lower bound maximization. CVB solutions yield deterministic iterative procedures for inferring IRM given the truncated number of clusters. Our proposal includes CVB0 updates of hyperparameters including the concentration parameter of the Dirichlet Process, which has not been studied in the literature. To make the CVB more practically useful, we further study the CVB inference in two aspects. First, we study the convergence issues and develop a convergence-guaranteed algorithm for any CVB-based inferences called ACVB, which enables automatic convergence detection and frees non-expert practitioners from difficult and costly manual monitoring of inference processes. Second, we present a few techniques for speeding up IRM inferences. In particular, we describe the linear time inference of CVB0, allowing the IRM for larger relational data uses. The ACVB solutions of IRM showed comparable or better performance compared to existing inference methods in experiments, and provide deterministic, faster, and easier convergence detection.

data mining, machine learning, natural language, (20 more...)

1409.4757

Country:

Asia > Japan (0.28)
North America > United States (0.28)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.54)
(2 more...)

Yildiz, Olcay Taner, Alpaydin, Ethem

Multivariate Comparison of Classification Algorithms

arXiv.org Machine LearningSep-16-2014

Statistical tests that compare classification algorithms are univariate and use a single performance measure, e.g., misclassification error, $F$ measure, AUC, and so on. In multivariate tests, comparison is done using multiple measures simultaneously. For example, error is the sum of false positives and false negatives and a univariate test on error cannot make a distinction between these two sources, but a 2-variate test can. Similarly, instead of combining precision and recall in $F$ measure, we can have a 2-variate test on (precision, recall). We use Hotelling's multivariate $T^2$ test for comparing two algorithms, and when we have three or more algorithms we use the multivariate analysis of variance (MANOVA) followed by pairwise post hoc tests. In our experiments, we see that multivariate tests have higher power than univariate tests, that is, they can detect differences that univariate tests cannot. We also discuss how multivariate analysis allows us to automatically extract performance measures that best distinguish the behavior of multiple algorithms.

artificial intelligence, machine learning, multivariate test, (16 more...)

1409.4566

Country: Asia > Middle East > Republic of Türkiye (0.14)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Coelho, Francisco, Nogueira, Vitor

Probabilistic Selection in AgentSpeak(L)

arXiv.org Artificial IntelligenceSep-12-2014

Agent programming is mostly a symbolic discipline and, as such, draws little benefits from probabilistic areas as machine learning and graphical models. However, the greatest objective of agent research is the achievement of autonomy in dynamical and complex environments --- a goal that implies embracing uncertainty and therefore the entailed representations, algorithms and techniques. This paper proposes an innovative and conflict free two layer approach to agent programming that uses already established methods and tools from both symbolic and probabilistic artificial intelligence. Moreover, this framework is illustrated by means of a widely used agent programming example, GoldMiners.

artificial intelligence, bayesian inference, machine learning, (15 more...)

doi: 10.1007/978-3-319-25524-8_44

1409.3717

Country:

South America > Brazil > Rio Grande do Sul (0.04)
Europe > Germany > Saarland (0.04)
Asia > India (0.04)

Genre: Research Report (0.40)

Industry: Materials > Metals & Mining (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.50)

Soloveychik, Ilya, Wiesel, Ami

Tyler's Covariance Matrix Estimator in Elliptical Models with Convex Structure

arXiv.org Machine LearningSep-11-2014

Covariance matrix estimation is a fundamental problem in the field of statistical signal processing. Many algorithms for detection and inference rely on accurate covariance estimators [1, 2]. The problem is well understood in the Gaussian unstructured case. But becomes significantly harder when the underlying distribution is non-Gaussian, for example in elliptical distributions, and when there is prior knowledge on the structure. In this paper, we propose a unified framework for covariance estimation in elliptical distributions with general convex structure. Over the last years there was a great interest in covariance estimation with known structure. The motivation to these works is that in many modern applications the dimension of the underlying distribution is large and there are not enough samples to estimate it precisely without additional structure hypotheses. The prior information on the structure reduces the number of degrees of freedom in the model and allows accurate estimation with a small number of samples. This is clearly true when the structure is exact, but also when it is approximate due to the well known bias-variance tradeoff.

artificial intelligence, estimator, machine learning, (16 more...)

1404.1935

Country: Asia > Middle East > Israel (0.47)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.93)

Sheikh, Abdul-Saboor, Shelton, Jacquelyn A., Lücke, Jörg

A Truncated EM Approach for Spike-and-Slab Sparse Coding

arXiv.org Machine LearningSep-3-2014

We study inference and learning based on a sparse coding model with `spike-and-slab' prior. As in standard sparse coding, the model used assumes independent latent sources that linearly combine to generate data points. However, instead of using a standard sparse prior such as a Laplace distribution, we study the application of a more flexible `spike-and-slab' distribution which models the absence or presence of a source's contribution independently of its strength if it contributes. We investigate two approaches to optimize the parameters of spike-and-slab sparse coding: a novel truncated EM approach and, for comparison, an approach based on standard factored variational distributions. The truncated approach can be regarded as a variational approach with truncated posteriors as variational distributions. In applications to source separation we find that both approaches improve the state-of-the-art in a number of standard benchmarks, which argues for the use of `spike-and-slab' priors for the corresponding data domains. Furthermore, we find that the truncated EM approach improves on the standard factored approach in source separation tasks$-$which hints to biases introduced by assuming posterior independence in the factored variational approach. Likewise, on a standard benchmark for image denoising, we find that the truncated EM approach improves on the factored variational approach. While the performance of the factored approach saturates with increasing numbers of hidden dimensions, the performance of the truncated approach improves the state-of-the-art for higher noise levels.

algorithm, experiment, sparse, (16 more...)

1211.3589

Country:

Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Germany > Lower Saxony > Oldenburg (0.04)
Europe > Germany > Berlin (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (0.46)

arXiv.org Artificial IntelligenceSep-3-2014

Constructing a Non-Negative Low Rank and Sparse Graph with Data-Adaptive Features

Zhuang, Liansheng, Gao, Shenghua, Tang, Jinhui, Wang, Jingjing, Lin, Zhouchen, Ma, Yi

This paper aims at constructing a good graph for discovering intrinsic data structures in a semi-supervised learning setting. Firstly, we propose to build a non-negative low-rank and sparse (referred to as NNLRS) graph for the given data representation. Specifically, the weights of edges in the graph are obtained by seeking a nonnegative low-rank and sparse matrix that represents each data sample as a linear combination of others. The so-obtained NNLRS-graph can capture both the global mixture of subspaces structure (by the low rankness) and the locally linear structure (by the sparseness) of the data, hence is both generative and discriminative. Secondly, as good features are extremely important for constructing a good graph, we propose to learn the data embedding matrix and construct the graph jointly within one framework, which is termed as NNLRS with embedded features (referred to as NNLRS-EF). Extensive experiments on three publicly available datasets demonstrate that the proposed method outperforms the state-of-the-art graph construction method by a large margin for both semi-supervised classification and discriminative analysis, which verifies the effectiveness of our proposed method.

artificial intelligence, machine learning, representation, (17 more...)

doi: 10.1109/TIP.2015.2441632

1409.0964

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > China > Shanghai > Shanghai (0.04)
(3 more...)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Muhammadi, Jafar, Rabiee, Hamid Reza, Hosseini, Abbas

Crowd Labeling: a survey

arXiv.org Artificial IntelligenceSep-3-2014

Recently, there has been a burst in the number of research projects on human computation via crowdsourcing. Multiple choice (or labeling) questions could be referred to as a common type of problem which is solved by this approach. As an application, crowd labeling is applied to find true labels for large machine learning datasets. Since crowds are not necessarily experts, the labels they provide are rather noisy and erroneous. This challenge is usually resolved by collecting multiple labels for each sample, and then aggregating them to estimate the true label. Although the mechanism leads to high-quality labels, it is not actually cost-effective. As a result, efforts are currently made to maximize the accuracy in estimating true labels, while fixing the number of acquired labels. This paper surveys methods to aggregate redundant crowd labels in order to estimate unknown true labels. It presents a unified statistical latent model where the differences among popular methods in the field correspond to different choices for the parameters of the model. Afterwards, algorithms to make inference on these models will be surveyed. Moreover, adaptive methods which iteratively collect labels based on the previously collected labels and estimated models will be discussed. In addition, this paper compares the distinguished methods, and provides guidelines for future work required to address the current open issues.

artificial intelligence, data mining, machine learning, (19 more...)

1301.2774

Country:

North America > United States (1.00)
Europe (0.92)
Asia > Middle East > Iran (0.14)

Genre: Research Report (0.82)

Industry:

Education > Educational Setting (0.46)
Leisure & Entertainment > Games (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
(3 more...)

Journal of Artificial Intelligence ResearchAug-31-2014

Belief Tracking for Planning with Sensing: Width, Complexity and Approximations

Bonet, B., Geffner, H.

We consider the problem of belief tracking in a planning setting where states are valuations over a set of variables that are partially observable, and beliefs stand for the sets of states that are possible. While the problem is intractable in the worst case, it has been recently shown that in deterministic conformant and contingent problems, belief tracking is exponential in a width parameter that is often bounded and small. In this work, we extend these results in two ways. First, we introduce a width notion that applies to non-deterministic problems as well, develop a factored belief tracking algorithm that is exponential in the problem width, and show how it applies to existing benchmarks. Second, we introduce a meaningful, powerful, and sound approximation scheme, beam tracking, that is exponential in a smaller parameter, the problem causal width, and has much broader applicability. We illustrate the value of this algorithm over large instances of problems such as Battleship, Minesweeper, and Wumpus, where it yields state-of-the-art performance in real-time.

artificial intelligence, logic & formal reasoning, machine learning, (19 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.4475

AI Access Foundation

10901

Journal of Artificial Intelligence Research

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > Canada > Ontario > Toronto (0.04)
(15 more...)

Industry: Government > Military > Navy (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)