AITopics

This paper describes a simple and efficient method to make template-based object classification invariant to in-plane rotations. The task is divided into two parts: orientation discrimination and classification. The key idea is to perform the orientation discrimination before the classification. This can be accomplished by hypothesizing, in turn, that the input image belongs to each class of interest. The image can then be rotated to maximize its similarity to the training images in each class (these contain the prototype object in an upright orientation). This process yields a set of images, at least one of which will have the object in an upright position. The resulting images can then be classified by models which have been trained with only upright examples. This approach has been successfully applied to two real-world vision-based tasks: rotated handwritten digit recognition and rotated face detection in cluttered scenes.

classification network, digit, rotation, (10 more...)

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
(2 more...)

Cornford, Dan, Nabney, Ian T., Williams, Christopher K. I.

Adding Constrained Discontinuities to Gaussian Process Models of Wind Fields

Gaussian Processes provide good prior models for spatial data, but can be too smooth. In many physical situations there are discontinuities along bounding surfaces, for example fronts in near-surface wind fields. We describe a modelling method for such a constrained discontinuity and demonstrate how to infer the model parameters in wind fields with MCMC sampling.

constrained discontinuity, discontinuity, wind field, (15 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > New York (0.04)
Europe > United Kingdom > Scotland (0.04)
(2 more...)

Industry: Energy > Renewable > Wind (0.91)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.31)

Probabilistic Modeling for Face Orientation Discrimination: Learning from Labeled and Unlabeled Data

Baluja, Shumeet

This paper presents probabilistic modeling methods to solve the problem of discriminating between five facial orientations with very little labeled data.

dependency, pixel, unlabeled data, (12 more...)

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Rao, Rajesh P. N., Ruderman, Daniel L.

Learning Lie Groups for Invariant Visual Perception

One of the most important problems in visual perception is that of visual invariance: how are objects perceived to be the same despite undergoing transformations such as translations, rotations or scaling? In this paper, we describe a Bayesian method for learning invariances based on Lie group theory. We show that previous approaches based on first-order Taylor series expansions of inputs can be regarded as special cases of the Lie group approach, the latter being capable of handling in principle arbitrarily large transfonnations. Using a matrixexponential based generative model of images, we derive an unsupervised algorithm for learning Lie group operators from input data containing infinitesimal transfonnations.

matrix, transfonnation, transformation, (15 more...)

Country:

Europe > Sweden > Östergötland County > Linköping (0.05)
North America > United States > New York (0.04)
North America > United States > California > San Mateo County > San Mateo (0.04)
(2 more...)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.94)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)

Support Vector Machines Applied to Face Recognition

Phillips, P. Jonathon

Face recognition is different from classical pattern-recognition problems such as character recognition.

algorithm, decision surface, recognition, (15 more...)

Country:

North America > United States > New York (0.04)
North America > United States > Maryland > Montgomery County > Gaithersburg (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

A V1 Model of Pop Out and Asymmetty in Visual Search

Li, Zhaoping

Visual input liB persists after onset, and initializes the activity levels 9x (XiO). The activities are then modified by the contextual influences. Depending on the visual input, the system often settles into an oscillatory state (Gray A VI Modelo/Pop Out and Asymmetry in Visual Search 799 and Singer, 1989, see the details in Li 1998b). Temporal averages of gx(XiO) over several oscillation cycles are used as the model's output. The nature of the computation performed by the model is determined largely by the horizontal connections J and W, which are local (spanning only a few hypercolumns), and translation and rotation invariant (Figure 1B).

asymmetry, orientation, segmentation, (17 more...)

Country:

North America > United States (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Technology:

Information Technology > Information Management > Search (0.66)
Information Technology > Artificial Intelligence > Vision (0.56)

Attentional Modulation of Human Pattern Discrimination Psychophysics Reproduced by a Quantitative Model

Itti, Laurent, Braun, Jochen, Lee, Dale K., Koch, Christof

We previously proposed a quantitative model of early visual processing in primates, based on non-linearly interacting visual filters and statistically efficient decision. We now use this model to interpret the observed modulation of a range of human psychophysical thresholds with and without focal visual attention. Our model - calibrated by an automatic fitting procedure - simultaneously reproduces thresholds for four classical pattern discrimination tasks, performed while attention was engaged by another concurrent task. Our model then predicts that the seemingly complex improvements of certain thresholds, which we observed when attention was fully available for the discrimination tasks, can best be explained by a strengthening of competition among early visual filters. 1 INTRODUCTION What happens when we voluntarily focus our attention to a restricted part of our visual field? Focal attention is often thought as a gating mechanism, which selectively allows a certain spatial location and and certain types of visual features to reach higher visual processes.

orientation, threshold, visual processing, (11 more...)

Country: North America > United States > California > Los Angeles County > Pasadena (0.04)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.61)

Technology:

Information Technology > Artificial Intelligence > Vision (0.58)
Information Technology > Artificial Intelligence > Machine Learning (0.46)

Ioffe, Sergey, Forsyth, David A.

Learning to Find Pictures of People

Finding articulated objects, like people, in pictures present.s a particularly difficult object.

classifier, configuration, learning, (17 more...)

Country: North America > United States > California > Alameda County > Berkeley (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Freeman, William T., Pasztor, Egon C.

Learning to Estimate Scenes from Images

We seek the scene interpretation that best explains image data. For example, we may want to infer the projected velocities (scene) which best explain two consecutive image frames (image).

probability, propagation, scene patch, (14 more...)