AITopics

doi: 10.1109/ROBIO.2011.6181516

1107.0399

Country:

Europe > Russia (0.14)
Asia > Middle East (0.14)

Technology:

Information Technology > Sensing and Signal Processing (0.92)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.34)

Girón, Edwin, Frery, Alejandro C., Cribari-Neto, Francisco

Nonparametric Edge Detection in Speckled Imagery

arXiv.org Machine LearningJul-8-2012

We address the issue of edge detection in Synthetic Aperture Radar imagery. In particular, we propose nonparametric methods for edge detection, and numerically compare them to an alternative method that has been recently proposed in the literature. Our results show that some of the proposed methods display superior results and are computationally simpler than the existing method. An application to real (not simulated) data is presented and discussed.

artificial intelligence, edge detection, machine learning, (18 more...)

doi: 10.1016/j.matcom.2012.04.013

1207.1915

Country: South America > Brazil > Pernambuco (0.14)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Vision (0.84)

Tang, Yichuan, Salakhutdinov, Ruslan, Hinton, Geoffrey

Deep Lambertian Networks

arXiv.org Machine LearningJun-27-2012

Visual perception is a challenging problem in part due to illumination variations. A possible solution is to first estimate an illumination invariant representation before using it for recognition. The object albedo and surface normals are examples of such representations. In this paper, we introduce a multilayer generative model where the latent variables include the albedo, surface normals, and the light source. Combining Deep Belief Nets with the Lambertian reflectance assumption, our model can learn good priors over the albedo from 2D images. Illumination variations can be explained by changing only the lighting latent variable in our model. By transferring learned knowledge from similar objects, albedo and surface normals estimation from a single image is possible in our model. Experiments demonstrate that our model is able to generalize as well as improve over standard baselines in one-shot face recognition.

deep learning, neural network, surface normal, (18 more...)

1206.6445

Country: North America > Canada > Ontario > Toronto (0.15)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Katyal, Vini, Aviral, null

Leaf vein segmentation using Odd Gabor filters and morphological operations

arXiv.org Artificial IntelligenceJun-22-2012

Leaf vein forms the basis of leaf characterization and classification. Different species have different leaf vein patterns. It is seen that leaf vein segmentation will help in maintaining a record of all the leaves according to their specific pattern of veins thus provide an effective way to retrieve and store information regarding various plant species in database as well as provide an effective means to characterize plants on the basis of leaf vein structure which is unique for every species. The algorithm proposes a new way of segmentation of leaf veins with the use of Odd Gabor filters and the use of morphological operations for producing a better output. The Odd Gabor filter gives an efficient output and is robust and scalable as compared with the existing techniques as it detects the fine fiber like veins present in leaves much more efficiently.

artificial intelligence, gabor filter, odd gabor filter, (16 more...)

1206.5157

Country:

Asia > India (0.17)
Asia > Taiwan (0.15)

Industry: Media (0.30)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (0.71)

arXiv.org Machine LearningJun-18-2012

On multi-view feature learning

Memisevic, Roland

Sparse coding is a common approach to learning local features for object recognition. Recently, there has been an increasing interest in learning features from spatio-temporal, binocular, or other multi-observation data, where the goal is to encode the relationship between images rather than the content of a single image. We provide an analysis of multi-view feature learning, which shows that hidden variables encode transformations by detecting rotation angles in the eigenspaces shared among multiple image warps. Our analysis helps explain recent experimental results showing that transformation-specific features emerge when training complex cell models on videos. Our analysis also shows that transformation-invariant features can emerge as a by-product of learning representations of transformations.

artificial intelligence, neural network, transformation, (19 more...)

1206.4609

Country:

Europe > Germany (0.28)
Europe > United Kingdom > Scotland (0.14)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Damianou, Andreas, Ek, Carl, Titsias, Michalis, Lawrence, Neil

Manifold Relevance Determination

arXiv.org Machine LearningJun-18-2012

In this paper we present a fully Bayesian latent variable model which exploits conditional nonlinear(in)-dependence structures to learn an efficient latent representation. The latent space is factorized to represent shared and private information from multiple views of the data. In contrast to previous approaches, we introduce a relaxation to the discrete segmentation and allow for a "softly" shared latent space. Further, Bayesian techniques allow us to automatically estimate the dimensionality of the latent spaces. The model is capable of capturing structure underlying extremely high dimensional spaces. This is illustrated by modelling unprocessed images with tenths of thousands of pixels. This also allows us to directly generate novel images from the trained model by sampling from the discovered latent spaces. We also demonstrate the model by prediction of human pose in an ambiguous setting. Our Bayesian framework allows us to perform disambiguation in a principled manner by including latent space priors which incorporate the dynamic nature of the data.

artificial intelligence, bayesian inference, latent space, (17 more...)

1206.461

Country: Europe > United Kingdom > Scotland (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.70)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.34)

Kumar, M. Pawan, Packer, Ben, Koller, Daphne

Modeling Latent Variable Uncertainty for Loss-based Learning

arXiv.org Artificial IntelligenceJun-18-2012

We consider the problem of parameter estimation using weakly supervised datasets, where a training sample consists of the input and a partially specified annotation, which we refer to as the output. The missing information in the annotation is modeled using latent variables. Previous methods overburden a single distribution with two separate tasks: (i) modeling the uncertainty in the latent variables during training; and (ii) making accurate predictions for the output and the latent variables during testing. We propose a novel framework that separates the demands of the two tasks using two distributions: (i) a conditional distribution to model the uncertainty of the latent variables for a given input-output pair; and (ii) a delta distribution to predict the output and the latent variables for a given input. During learning, we encourage agreement between the two distributions by minimizing a loss-based dissimilarity coefficient. Our approach generalizes latent SVM in two important ways: (i) it models the uncertainty over latent variables instead of relying on a pointwise estimate; and (ii) it allows the use of loss functions that depend on latent variables, which greatly increases its applicability. We demonstrate the efficacy of our approach on two challenging problems---object detection and action detection---using publicly available datasets.

artificial intelligence, latent variable, machine learning, (16 more...)

1206.4636

Country: North America > United States > California > Santa Clara County (0.14)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.89)

Grafsgaard, Joseph F. (North Carolina State University) | Boyer, Kristy Elizabeth (North Carolina State University) | Wiebe, Eric N. (North Carolina State University) | Lester, James C. (North Carolina State University)

Analyzing Posture and Affect in Task-Oriented Tutoring

AAAI ConferencesMay-20-2012

Intelligent tutoring systems research aims to produce systems that meet or exceed the effectiveness of one-on-one expert human tutoring. Theory and empirical study suggest that affective states of the learner must be addressed to achieve this goal. While many affective measures can be utilized, posture offers the advantages of non-intrusiveness and ease of interpretation. This paper presents an accurate posture estimation algorithm applied to a computer-mediated tutoring corpus of depth recordings. Analyses of posture and session-level student reports of engagement and cognitive load identified significant patterns. The results indicate that disengagement and frustration may coincide with closer postural positions and more movement, while focused attention and less frustration occur with more distant, stable postural positions. It is hoped that this work will lead to intelligent tutoring systems that recognize a greater breadth of affective expression through channels of posture and gesture.

computer based training, educational technology, student, (17 more...)

AAAI Conferences

Twenty-Fifth International FLAIRS Conference

Country: North America > United States > North Carolina (0.15)

Genre:

Research Report > Experimental Study (0.48)
Research Report > New Finding (0.46)

Industry: Education > Educational Technology > Educational Software > Computer Based Training (1.00)

Technology:

Information Technology > Human Computer Interaction > Interfaces (0.56)
Information Technology > Artificial Intelligence > Machine Learning (0.47)
Information Technology > Artificial Intelligence > Vision (0.47)

arXiv.org Machine LearningMay-1-2012

Hybrid Linear Modeling via Local Best-fit Flats

Zhang, Teng, Szlam, Arthur, Wang, Yi, Lerman, Gilad

We present a simple and fast geometric method for modeling data by a union of affine subspaces. The method begins by forming a collection of local best-fit affine subspaces, i.e., subspaces approximating the data in local neighborhoods. The correct sizes of the local neighborhoods are determined automatically by the Jones' $\beta_2$ numbers (we prove under certain geometric conditions that our method finds the optimal local neighborhoods). The collection of subspaces is further processed by a greedy selection procedure or a spectral method to generate the final model. We discuss applications to tracking-based motion segmentation and clustering of faces under different illuminating conditions. We give extensive experimental evidence demonstrating the state of the art accuracy and speed of the suggested algorithms on these problems and also on synthetic hybrid linear data as well as the MNIST handwritten digits data; and we demonstrate how to use our algorithms for fast determination of the number of affine subspaces.

algorithm, artificial intelligence, machine learning, (17 more...)

doi: 10.1007/s11263-012-0535-6

1010.346

Country:

Asia (0.46)
North America > United States (0.28)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Barbu, Andrei, Michaux, Aaron, Narayanaswamy, Siddharth, Siskind, Jeffrey Mark

Simultaneous Object Detection, Tracking, and Event Recognition

arXiv.org Artificial IntelligenceApr-12-2012

The common internal structure and algorithmic organization of object detection, detection-based tracking, and event recognition facilitates a general approach to integrating these three components. This supports multidirectional information flow between these components allowing object detection to influence tracking and event recognition and event recognition to influence tracking and object detection. The performance of the combination can exceed the performance of the components in isolation. This can be done with linear asymptotic complexity.

artificial intelligence, detection, image understanding, (17 more...)

1204.2741

Country:

North America > United States > California (0.14)
North America > United States > Indiana > Tippecanoe County (0.14)

Industry: Government > Military (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.68)