AITopics

We propose a probabilistic, generative account of configural learning phenomena in classical conditioning. Configural learning experiments probe how animals discriminate and generalize between patterns of simultaneously presentedstimuli (such as tones and lights) that are differentially predictive of reinforcement. Previous models of these issues have been successful more on a phenomenological than an explanatory level: they reproduce experimental findings but, lacking formal foundations, providescant basis for understanding why animals behave as they do. We present a theory that clarifies seemingly arbitrary aspects of previous modelswhile also capturing a broader set of data.

bayesian inference, compound, neurology, (20 more...)

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.94)

Log-concavity Results on Gaussian Process Methods for Supervised and Unsupervised Learning

Paninski, Liam

Log-concavity is an important property in the context of optimization, Laplace approximation, and sampling; Bayesian methods based on Gaussian processpriors have become quite popular recently for classification, regression, density estimation, and point process intensity estimation. Here we prove that the predictive densities corresponding to each of these applications are log-concave, given any observed data. We also prove that the likelihood is log-concave in the hyperparameters controlling the mean function of the Gaussian prior in the density and point process intensity estimationcases, and the mean, covariance, and observation noise parameters in the classification and regression cases; this result leads to a useful parameterization of these hyperparameters, indicating a suitably large class of priors for which the corresponding maximum a posteriori problem is log-concave. Introduction Bayesian methods based on Gaussian process priors have recently become quite popular for machine learning tasks (1). These techniques have enjoyed a good deal of theoretical examination, documenting their learning-theoretic (generalization) properties (2), and developing avariety of efficient computational schemes (e.g., (3-5), and references therein).

artificial intelligence, bayesian inference, likelihood, (18 more...)

Country: North America > Canada > Ontario > Toronto (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.54)

Steinwart, Ingo, Scovel, Clint

Fast Rates to Bayes for Kernel Machines

We establish learning rates to the Bayes risk for support vector machines (SVMs) with hinge loss. In particular, for SVMs with Gaussian RBF kernels we propose a geometric condition for distributions which can be used to determine approximation properties of these kernels. Finally, we compare our methods with a recent paper of G. Blanchard et al..

artificial intelligence, machine learning, svm, (18 more...)

Country: North America > United States (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.58)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.31)

A Method for Inferring Label Sampling Mechanisms in Semi-Supervised Learning

Rosset, Saharon, Zhu, Ji, Zou, Hui, Hastie, Trevor J.

We consider the situation in semi-supervised learning, where the "label sampling" mechanism stochastically depends on the true response (as well as potentially on the features). We suggest a method of moments for estimating this stochastic dependence using the unlabeled data. This is potentially useful for two distinct purposes: a. As an input to a supervised learningprocedure which can be used to "de-bias" its results using labeled data only and b.

artificial intelligence, inductive learning, mechanism, (14 more...)

Country:

North America > United States > California > Santa Clara County (0.14)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.87)

Sudderth, Erik B., Mandel, Michael I., Freeman, William T., Willsky, Alan S.

Distributed Occlusion Reasoning for Tracking with Nonparametric Belief Propagation

We describe a three-dimensional geometric hand model suitable for visual trackingapplications. The kinematic constraints implied by the model's joints have a probabilistic structure which is well described by a graphical model. Inference in this model is complicated by the hand's many degrees of freedom, as well as multimodal likelihoods caused by ambiguous image measurements. We use nonparametric belief propagation (NBP)to develop a tracking algorithm which exploits the graph's structure to control complexity, while avoiding costly discretization. While kinematic constraints naturally have a local structure, self-occlusions created by the imaging process lead to complex interpendencies incolor and edge-based likelihood functions. However, we show that local structure may be recovered by introducing binary hidden variables describingthe occlusion state of each pixel. We augment the NBP algorithm to infer these occlusion variables in a distributed fashion, and then analytically marginalize over them to produce hand position estimates whichproperly account for occlusion events. We provide simulations showing that NBP may be used to refine inaccurate model initializations, aswell as track hand motion through extended image sequences.

artificial intelligence, belief revision, constraint, (17 more...)

Country: North America > United States > Massachusetts (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (0.62)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.46)

Ihler, Alexander T., Fisher, John W., Willsky, Alan S.

Message Errors in Belief Propagation

Belief propagation (BP) is an increasingly popular method of performing approximate inference on arbitrary graphical models.

artificial intelligence, graph, loopy bp, (16 more...)

Country: North America > United States > Massachusetts (0.14)

Industry: Energy > Oil & Gas (0.89)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)

Lal, Thomas N., Hinterberger, Thilo, Widman, Guido, Schröder, Michael, Hill, N. J., Rosenstiel, Wolfgang, Elger, Christian E., Birbaumer, Niels, Schölkopf, Bernhard

Methods Towards Invasive Human Brain Computer Interfaces

During the last ten years there has been growing interest in the development ofBrain Computer Interfaces (BCIs). The field has mainly been driven by the needs of completely paralyzed patients to communicate. With a few exceptions, most human BCIs are based on extracranial electroencephalography (EEG).However, reported bit rates are still low. One reason for this is the low signal-to-noise ratio of the EEG [16]. We are currently investigating if BCIs based on electrocorticography (ECoG) are a viable alternative. In this paper we present the method and examples of intracranial EEG recordings of three epilepsy patients with electrode grids placed on the motor cortex. The patients were asked to repeatedly imaginemovements of two kinds, e.g., tongue or finger movements. We analyze the classifiability of the data using Support Vector Machines (SVMs) [18, 21] and Recursive Channel Elimination (RCE) [11].

electrode, epilepsy, neural network, (19 more...)

Country:

North America > United States (0.47)
Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.15)

Industry: Health & Medicine > Therapeutic Area > Neurology > Epilepsy (0.70)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Neuroscience (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.70)

Madani, Omid, Pennock, David M., Flake, Gary W.

Co-Validation: Using Model Disagreement on Unlabeled Data to Validate Classification Algorithms

In the context of binary Classification, we define disagreement as a measure of how often two independently-trained models differ in their Classification of unlabeled data.

artificial intelligence, disagreement, machine learning, (17 more...)

Country: North America > United States (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.73)

Cesa-bianchi, Nicolò, Gentile, Claudio, Tironi, Andrea, Zaniboni, Luca

Incremental Algorithms for Hierarchical Classification

We study the problem of hierarchical classification when labels corresponding topartial and/or multiple paths in the underlying taxonomy are allowed. We introduce a new hierarchical loss function, the H-loss, implementing thesimple intuition that additional mistakes in the subtree of a mistaken class should not be charged for. Based on a probabilistic data model introduced in earlier work, we derive the Bayes-optimal classifier for the H-loss. We then empirically compare two incremental approximations ofthe Bayes-optimal classifier with a flat SVM classifier and with classifiers obtained by using hierarchical versions of the Perceptron and SVM algorithms. The experiments show that our simplest incremental approximationof the Bayes-optimal classifier performs, after just one training epoch, nearly as well as the hierarchical SVM classifier (which performs best). For the same incremental algorithm we also derive an H-loss bound showing, when data are generated by our probabilistic data model, exponentially fast convergence to the H-loss of the hierarchical classifier based on the true model parameters.

algorithm, artificial intelligence, machine learning, (19 more...)

Country: Europe > Italy (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.69)