AITopics

Large numbers of descriptors and large codebooks are needed for good results and this becomes slow using k-means. We introduce Extremely Randomized Clustering Forests - ensembles of randomly created clustering trees - and show that these provide more accurate results, much faster training and testing and good resistance to background clutter in several state-of-the-art image classification tasks.

artificial intelligence, descriptor, machine learning, (18 more...)

Country: Europe (0.69)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Meeds, Edward, Ghahramani, Zoubin, Neal, Radford M., Roweis, Sam T.

Modeling Dyadic Data with Binary Latent Factors

We introduce binary matrix factorization, a novel model for unsupervised matrix decomposition.The decomposition is learned by fitting a nonparametric Bayesian probabilistic model with binary latent variables to a matrix of dyadic data. Unlike bi-clustering models, which assign each row or column to a single cluster based on a categorical hidden feature, our binary feature model reflects the prior belief that items and attributes can be associated with more than one latent cluster at a time. We provide simple learning and inference rules for this new model and show how to extend it to an infinite model in which the number of features is not a priori fixed but is allowed to grow with the size of the data.

artificial intelligence, bayesian inference, machine learning, (19 more...)

Country: North America > Canada > Ontario > Toronto (0.15)

Genre: Research Report (0.34)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.47)

Isotonic Conditional Random Fields and Local Sentiment Flow

Mao, Yi, Lebanon, Guy

Predicting the document's sentiment would allow matching

machine learning, natural language, sentiment, (17 more...)

Country: North America > United States > Indiana > Tippecanoe County (0.14)

Industry:

Media > Film (0.69)
Leisure & Entertainment (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Lu, Le, Hager, Gregory D.

Dynamic Foreground/Background Extraction from Images and Videos using Random Patches

In this paper, we propose a novel exemplar-based approach to extract dynamic foreground regions from a changing background within a collection of images or a video sequence. By using image segmentation as a pre-processing step, we convert this traditional pixel-wise labeling problem into a lower-dimensional supervised, binarylabeling procedure on image segments. Our approach consists of three steps. First, a set of random image patches are spatially and adaptively sampled withineach segment. Second, these sets of extracted samples are formed into two "bags of patches" to model the foreground/background appearance, respectively.

artificial intelligence, image patch, machine learning, (18 more...)

Country: North America > United States (0.28)

Industry: Education > Educational Setting (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Listgarten, Jennifer, Neal, Radford M., Roweis, Sam T., Puckrin, Rachel, Cutler, Sean

Bayesian Detection of Infrequent Differences in Sets of Time Series with Shared Structure

We present a hierarchical Bayesian model for sets of related, but different, classes of time series data. Our model performs alignment simultaneously across all classes, while detecting and characterizing class-specific differences. During inference themodel produces, for each class, a distribution over a canonical representation ofthe class. These class-specific canonical representations are automatically aligned to one another -- preserving common substructures, and highlighting differences.

artificial intelligence, machine learning, time sery, (17 more...)

Country: North America > Canada > Ontario > Toronto (0.14)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.88)

Lindgren, J.t., Hyvärinen, Aapo

Emergence of conjunctive visual features by quadratic independent component analysis

In this paper we estimate quadratic models for natural images using Independent Component Analysis (ICA). The used quadratic functions are a natural extension to linear functions (i.e.

artificial intelligence, eigenvector, machine learning, (17 more...)

Country: Europe > Finland (0.15)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.95)
Information Technology > Artificial Intelligence > Machine Learning (0.95)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.40)

Learnability and the doubling dimension

Li, Yi, Long, Philip M.

We prove bounds on the sample complexity of PAC learning in terms of the doubling dimension of this metric. These bounds imply known bounds on the sample complexity of learning halfspaces with respect to the uniform distribution that are optimal up to a constant factor.

artificial intelligence, dimension, machine learning, (18 more...)

Country:

Europe (0.14)
Asia (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.72)

Li, Ping, Church, Kenneth W., Hastie, Trevor J.

Conditional Random Sampling: A Sketch-based Sampling Technique for Sparse Data

In large-scale applications, the data are often highly sparse. CRS combines sketching and sampling in that it converts sketches of the data into conditional random samples online in the estimation stage, with the sample size determined retrospectively.

artificial intelligence, machine learning, random projection, (13 more...)

Country: North America > United States > California > Santa Clara County (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Jaeger, T. F., Levy, Roger P.

Speakers optimize information density through syntactic reduction

If language users are rational, they might choose to structure their utterances so as to optimize communicative properties. In particular, information-theoretic and psycholinguistic considerations suggest that this may include maximizing the uniformity ofinformation density in an utterance. We investigate this possibility in the context of syntactic reduction, where the speaker has the option of either marking a higher-order unit (a phrase) with an extra word, or leaving it unmarked. We demonstrate that speakers are more likely to reduce less information-dense phrases. In a second step, we combine a stochastic model of structured utterance production with a logistic-regression model of syntactic reduction to study which types of cues speakers employ when estimating the predictability of upcoming elements. We demonstrate that the trend toward predictability-sensitive syntactic reduction (Jaeger, 2006) is robust in the face of a wide variety of control variables, andpresent evidence that speakers use both surface and structural cues for predictability estimation.

artificial intelligence, machine learning, natural language, (18 more...)