AITopics

In supervised learning there is a typical presumption that the training and test points are taken from the same distribution. In practice this assumption is commonly violated.The situations where the training and test data are from different distributions is called covariate shift. Recent work has examined techniques for dealing with covariate shift in terms of minimisation of generalisation error. As yet the literature lacks a Bayesian generative perspective on this problem. This paper tackles this issue for regression models. Recent work on covariate shift can be understood in terms of mixture regression. Using this view, we obtain a general approach to regression under covariate shift, which reproduces previous work as a special case. The main advantages of this new formulation over previous models forcovariate shift are that we no longer need to presume the test and training densities are known, the regression and density estimation are combined into a single procedure, and previous methods are reproduced as special cases of this procedure, shedding light on the implicit assumptions the methods are making.

artificial intelligence, covariate shift, machine learning, (17 more...)

Country: Asia (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.92)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.48)

Navarro, Daniel J., Griffiths, Thomas L.

A Nonparametric Bayesian Method for Inferring Features From Similarity Judgments

The additive clustering model is widely used to infer the features of a set of stimuli from their similarities, on the assumption that similarity is a weighted linear function ofcommon features. This paper develops a fully Bayesian formulation of the additive clustering model, using methods from nonparametric Bayesian statistics to allow the number of features to vary. We use this to explore several approaches to parameter estimation, showing that the nonparametric Bayesian approach provides astraightforward way to obtain estimates of both the number of features used in producing similarity judgments and their importance.

artificial intelligence, machine learning, representation, (18 more...)

Country:

Asia (1.00)
North America > United States (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Learnability and the doubling dimension

Li, Yi, Long, Philip M.

We prove bounds on the sample complexity of PAC learning in terms of the doubling dimension of this metric. These bounds imply known bounds on the sample complexity of learning halfspaces with respect to the uniform distribution that are optimal up to a constant factor.

artificial intelligence, dimension, machine learning, (18 more...)

Country:

Europe (0.14)
Asia (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.72)

Kobayashi, Ryota, Shinomoto, Shigeru

Predicting spike times from subthreshold dynamics of a neuron

Though those simulators have been successful in reproducing qualitative aspects of neuronal responses to various conditions, quantitative reproduction as well as prediction for novel experiments appears to be difficult to realize [7]. The difficulty is due to the complexity of the model accompanied with a large number of undetermined free parameters.

artificial intelligence, machine learning, neuron, (16 more...)

Country:

Asia > Japan (0.15)
North America > United States (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.94)

Kawahara, Yoshinobu, Yairi, Takehisa, Machida, Kazuo

A Kernel Subspace Method by Stochastic Realization for Learning Nonlinear Dynamical Systems

We construct the theoretical underpinning and derive a concrete algorithm for nonlinear identification.

artificial intelligence, machine learning, matrix, (14 more...)

Country:

Europe (0.46)
Asia > Japan (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

El-Yaniv, Ran, Nisenson, Mordechai

Optimal Single-Class Classification Strategies

We consider single-class classification (SCC) as a two-person game between the learner and an adversary. In this game the target distribution is completely known to the learner and the learner's goal is to construct a classifier capable of guaranteeing agiven tolerance for the false-positive error while minimizing the false negative error. We identify both "hard" and "soft" optimal classification strategies for different types of games and demonstrate that soft classification can provide a significant advantage. Our optimal strategies and bounds provide worst-case lower bounds for standard, finite-sample SCC and also motivate new approaches to solving SCC.

data mining, learner, machine learning, (20 more...)

Country: Asia > Middle East (0.14)

Industry: Leisure & Entertainment > Games (0.88)

Technology:

Information Technology > Data Science > Data Mining (0.96)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.90)

Cuturi, Marco, Fukumizu, Kenji

Kernels on Structured Objects Through Nested Histograms

We propose a family of kernels for structured objects which is based on the bag-ofcomponents paradigm.However, rather than decomposing each complex object into the single histogram of its components, we use for each object a family of nested histograms, where each histogram in this hierarchy describes the object seen from an increasingly granular perspective. We use this hierarchy of histograms todefine elementary kernels which can detect coarse and fine similarities between the objects. We compute through an efficient averaging trick a mixture of such specific kernels, to propose a final kernel value which weights efficiently local and global matches. We propose experimental results on an image retrieval experiment which show that this mixture is an effective template procedure to be used with kernels on histograms.

artificial intelligence, kernel, machine learning, (17 more...)

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)

Genre: Research Report (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.85)

Morgan & Claypool PublishersDec-31-2007

A Concise Introduction to Multiagent Systems and Distributed Artificial Intelligence

Vlassis, Nikos

Multiagent systems is an expanding field that blends classical fields like game theory and decentralized control with modern fields like computer science and machine learning. This monograph provides a concise introduction to the subject, covering the theoretical foundations as well as more recent developments in a coherent and readable manner. The text is centered on the concept of an agent as decision maker. Chapter 1 is a short introduction to the field of multiagent systems. Chapter 2 covers the basic theory of singleagent decision making under uncertainty.

artificial intelligence, top description table, university, (9 more...)

Morgan & Claypool Publishers

Country:

Europe > Netherlands (0.24)
Europe > Greece (0.22)
Asia > Japan (0.19)
Europe > Switzerland (0.17)

Genre:

Collection (0.52)
Personal (0.33)

Industry: Leisure & Entertainment > Sports > Soccer (0.35)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Bhattacharya, I., Getoor, L.

Query-time Entity Resolution

Journal of Artificial Intelligence ResearchDec-27-2007

Entity resolution is the problem of reconciling database references corresponding to the same real-world entities. Given the abundance of publicly available databases that have unresolved entities, we motivate the problem of query-time entity resolution quick and accurate resolution for answering queries over such `unclean' databases at query-time. Since collective entity resolution approaches --- where related references are resolved jointly --- have been shown to be more accurate than independent attribute-based resolution for off-line entity resolution, we focus on developing new algorithms for collective resolution for answering entity resolution queries at query-time. For this purpose, we first formally show that, for collective resolution, precision and recall for individual entities follow a geometric progression as neighbors at increasing distances are considered. Unfolding this progression leads naturally to a two stage `expand and resolve' query processing strategy. In this strategy, we first extract the related records for a query using two novel expansion operators, and then resolve the extracted records collectively. We then show how the same strategy can be adapted for query-time entity resolution by identifying and resolving only those database references that are the most helpful for processing the query. We validate our approach on two large real-world publication databases where we show the usefulness of collective resolution and at the same time demonstrate the need for adaptive strategies for query processing. We then show how the same queries can be answered in real-time using our adaptive approach while preserving the gains of collective resolution. In addition to experiments on real datasets, we use synthetically generated data to empirically demonstrate the validity of the performance trends predicted by our analysis of collective entity resolution over a wide range of structural characteristics in the data.

entity resolution, query, resolution, (16 more...)

doi: 10.1613/jair.2290

AI Access Foundation

10524

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
North America > United States > District of Columbia > Washington (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(15 more...)

Genre: Research Report (0.67)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.88)

Domshlak, C., Hoffmann, J.

Probabilistic Planning via Heuristic Forward Search and Weighted Model Counting

Journal of Artificial Intelligence ResearchDec-17-2007

We present a new algorithm for probabilistic planning with no observability. Our algorithm, called Probabilistic-FF, extends the heuristic forward-search machinery of Conformant-FF to problems with probabilistic uncertainty about both the initial state and action effects. Specifically, Probabilistic-FF combines Conformant-FF's techniques with a powerful machinery for weighted model counting in (weighted) CNFs, serving to elegantly define both the search space and the heuristic function. Our evaluation of Probabilistic-FF shows its fine scalability in a range of probabilistic domains, constituting a several orders of magnitude improvement over previous results in this area. We use a problematic case to point out the main open issue to be addressed by further research.

conformant-ff, probabilistic-ff, probability, (15 more...)

doi: 10.1613/jair.2289

AI Access Foundation

10523