Technology
Sampling Methods for Unsupervised Learning
Fergus, Rob, Zisserman, Andrew, Perona, Pietro
We present an algorithm to overcome the local maxima problem in estimating theparameters of mixture models. It combines existing approaches fromboth EM and a robust fitting algorithm, RANSAC, to give a data-driven stochastic learning scheme. Minimal subsets of data points, sufficient to constrain the parameters of the model, are drawn from proposal densitiesto discover new regions of high likelihood. The proposal densities are learnt using EM and bias the sampling toward promising solutions. The algorithm is computationally efficient, as well as effective at escaping from local maxima. We compare it with alternative methods, including EM and RANSAC, on both challenging synthetic data and the computer vision problem of alpha-matting.
Learning Hyper-Features for Visual Identification
Ferencz, Andras D., Learned-miller, Erik G., Malik, Jitendra
We address the problem of identifying specific instances of a class (cars) from a set of images all belonging to that class. Although we cannot build a model for any particular instance (as we may be provided with only one "training" example of it), we can use information extracted from observing othermembers of the class. We pose this task as a learning problem, in which the learner is given image pairs, labeled as matching or not, and must discover which image features are most consistent for matching instances anddiscriminative for mismatches. We explore a patch based representation, where we model the distributions of similarity measurements definedon the patches. Finally, we describe an algorithm that selects the most salient patches based on a mutual information criterion. This algorithm performs identification well for our challenging dataset of car images, after matching only a few, well chosen patches.
A Cost-Shaping LP for Bellman Error Minimization with Performance Guarantees
Farias, Daniela D., Roy, Benjamin V.
We introduce a new algorithm based on linear programming that approximates the differential value function of an average-cost Markov decision process via a linear combination of pre-selected basis functions. The algorithm carries out a form of cost shaping and minimizes a version of Bellman error. We establish an error bound that scales gracefully with the number of states without imposing the (strong) Lyapunov condition required by its counterpart in[6]. We propose a path-following method that automates selection of important algorithm parameters which represent counterparts tothe "state-relevance weights" studied in [6].
Making Latin Manuscripts Searchable using gHMM's
Edwards, Jaety, Teh, Yee W., Bock, Roger, Maire, Michael, Vesom, Grace, Forsyth, David A.
We describe a method that can make a scanned, handwritten mediaeval latin manuscript accessible to full text search. A generalized HMM is fitted, using transcribed latin to obtain a transition model and one example eachof 22 letters to obtain an emission model. We show results for unigram, bigram and trigram models.
Sparse Coding of Natural Images Using an Overcomplete Set of Limited Capacity Units
Doi, Eizaburo, Lewicki, Michael S.
It has been suggested that the primary goal of the sensory system is to represent input in such a way as to reduce the high degree of redundancy. Givena noisy neural representation, however, solely reducing redundancy is not desirable, since redundancy is the only clue to reduce the effects of noise. Here we propose a model that best balances redundancy reductionand redundant representation. Like previous models, our model accounts for the localized and oriented structure of simple cells, but it also predicts a different organization for the population. With noisy, limited-capacity units, the optimal representation becomes an overcomplete, multi-scalerepresentation, which, compared to previous models, is in closer agreement with physiological data. These results offer a new perspective on the expansion of the number of neurons from retina to V1 and provide a theoretical model of incorporating useful redundancy into efficient neural representations.
Pictorial Structures for Molecular Modeling: Interpreting Density Maps
Dimaio, Frank, Phillips, George, Shavlik, Jude W.
X-ray crystallography is currently the most common way protein structures are elucidated. One of the most time-consuming steps in the crystallographic process is interpretation of the electron density map, a task that involves finding patterns in a three-dimensional picture of a protein. This paper describes DEFT (DEFormable Template), an algorithm using pictorial structures to build a flexible protein model from the protein's amino-acid sequence. Matching this pictorial structure into the density map is a way of automating density-map interpretation. Also described are several extensions to the pictorial structure matching algorithm necessary for this automated interpretation. DEFT is tested on a set of density maps ranging from 2 to 4Å resolution, producing rootmean-squared errorsranging from 1.38 to 1.84Å.
Bayesian inference in spiking neurons
We propose a new interpretation of spiking neurons as Bayesian integrators accumulatingevidence over time about events in the external world or the body, and communicating to other neurons their certainties about these events. In this model, spikes signal the occurrence of new information, i.e.what cannot be predicted from the past activity. As a result, firing statistics are close to Poisson, albeit providing a deterministic representation ofprobabilities. We proceed to develop a theory of Bayesian inference in spiking neural networks, recurrent interactions implementing avariant of belief propagation. Many perceptual and motor tasks performed by the central nervous system are probabilistic, andcan be described in a Bayesian framework [4, 3].
Semigroup Kernels on Finite Sets
Cuturi, Marco, Vert, Jean-philippe
Complex objects can often be conveniently represented by finite sets of simpler components, such as images by sets of patches or texts by bags of words. We study the class of positive definite (p.d.) kernels for two such objects that can be expressed as a function of the merger of their respective sets of components. We prove a general integral representation ofsuch kernels and present two particular examples. One of them leads to a kernel for sets of points living in a space endowed itself with a positive definite kernel. We provide experimental results on a benchmark experiment of handwritten digits image classification which illustrate the validity of the approach.