Image Processing
Learning Lie Groups for Invariant Visual Perception
Rao, Rajesh P. N., Ruderman, Daniel L.
One of the most important problems in visual perception is that of visual invariance: howare objects perceived to be the same despite undergoing transformations such as translations, rotations or scaling? In this paper, we describe a Bayesian method for learning invariances based on Lie group theory. We show that previous approaches based on first-order Taylor series expansions of inputs can be regarded as special cases of the Lie group approach, the latter being capable ofhandling in principle arbitrarily large transfonnations. Using a matrixexponential basedgenerative model of images, we derive an unsupervised algorithm for learning Lie group operators from input data containing infinitesimal transfonnations.
A Non-Parametric Multi-Scale Statistical Model for Natural Images
Bonet, Jeremy S. De, Viola, Paul A.
The observed distribution of natural images is far from uniform. On the contrary, real images have complex and important structure that can be exploited for image processing, recognition and analysis. There have been many proposed approaches to the principled statistical modeling of images, but each has been limited in either the complexity of the models or the complexity of the images. We present a nonparametric multi-scale statistical model for images that can be used for recognition, image de-noising, and in a "generative mode" to synthesize high quality textures.
Modeling Complex Cells in an Awake Macaque during Natural Image Viewing
Vinje, William E., Gallant, Jack L.
Our model consists of a classical energy mechanism whose output is divided by nonclassical gain control and texture contrast mechanisms. We apply this model to review movies, a stimulus sequence that replicates the stimulation a cell receives during free viewing of natural images. Data were collected from three cells using five different review movies, and the model was fit separately to the data from each movie. For the energy mechanism alone we find modest but significant correlations (rE 0.41, 0.43, 0.59, 0.35) between model and data. These correlations are improved somewhat when we allow for suppressive surround effects (rE G 0.42, 0.56, 0.60, 0.37). In one case the inclusion of a delayed suppressive surround dramatically improves the fit to the data by modifying the time course of the model's response.
A Non-Parametric Multi-Scale Statistical Model for Natural Images
Bonet, Jeremy S. De, Viola, Paul A.
The observed distribution of natural images is far from uniform. On the contrary, real images have complex and important structure thatcan be exploited for image processing, recognition and analysis. There have been many proposed approaches to the principled statisticalmodeling of images, but each has been limited in either the complexity of the models or the complexity of the images. Wepresent a nonparametric multi-scale statistical model for images that can be used for recognition, image de-noising, and in a "generative mode" to synthesize high quality textures.
An Analog VLSI Neural Network for Phase-based Machine Vision
Shi, Bertram Emil, Hui, Kwok Fai
Gabor filters are used as preprocessing stages for different tasks in machine vision and image processing. Their use has been partially motivated by findings that two dimensional Gabor filters can be used to model receptive fields of orientation selective neurons in the visual cortex (Daugman, 1980) and three dimensional spatiotemporal Gabor filters can be used to model biological image motion analysis (Adelson, 1985). A Gabor filter has a complex valued impulse response which is a complex exponential modulated by a Gaussian function.