AITopics | Vision

Collaborating Authors

Vision

"What exactly is computer vision then? Computer vision is a research field working to equip computers with the ability to process and understand visual data, as sighted humans can. Human brains process the gigabytes of data passing through our eyes every second and translate that data into sight - that is, into discrete objects and entities we can recognise or understand. Similarly, computer vision aims to give computers the ability to understand what they are seeing, and act intelligently on that knowledge."
– Computer vision: Cheat Sheet. ZDNet.com (December 6, 2011), by Natasha Lomas.

News Overviews Instructional Materials AI-Alerts Classics

Experimental Standards in Research on AI and Humor When Considering Psychology

Platt, Tracey (University of Zurich) | Hofmann, Jennifer (University of Zurich) | Ruch, Willibald (University of Zurich) | Niewiadomski, Radoslaw (Rue Dareau, Paris) | Urbain, Jérôme (Univeristy of Mons)

AAAI ConferencesNov-5-2012

Based on recent experiences between a laughing virtual agent and a human user at the intersection AI and humor and laughter, this paper aims to highlight some of the psychological considerations, when conducting AI and humor experiments. The systematic and standardized approach outlined in this paper will demonstrate how to reduce error variance that may be caused by confound variables such as having poor experimental controls. From the necessity of cover stories, protocols and procedures, the differences to the pros and cons of measuring subjectively and objectively and what is required so that both give valid and reliable results are offered as solutions to achieving this goal. Furthermore, the psychological individual differences that need consideration, such as the appreciation of different types of humor, mood, personality variables, for example, trait and state cheerfulness, and gelotophobia- the fear of being laughed at are discussed.

artificial intelligence, neural network, participant, (17 more...)

AAAI Conferences

2012 AAAI Fall Symposium Series

Country:

North America > United States (0.46)
Europe > Switzerland > Zürich > Zürich (0.14)

Genre:

Questionnaire & Opinion Survey (0.95)
Research Report > Experimental Study (0.46)

Industry: Health & Medicine > Therapeutic Area (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Artificial Intelligence > Cognitive Science (0.68)

Add feedback

Efficient Point-to-Subspace Query in $\ell^1$: Theory and Applications in Computer Vision

Sun, Ju, Zhang, Yuqian, Wright, John

arXiv.org Machine LearningNov-4-2012

Motivated by vision tasks such as robust face and object recognition, we consider the following general problem: given a collection of low-dimensional linear subspaces in a high-dimensional ambient (image) space and a query point (image), efficiently determine the nearest subspace to the query in $\ell^1$ distance. We show in theory that Cauchy random embedding of the objects into significantly-lower-dimensional spaces helps preserve the identity of the nearest subspace with constant probability. This offers the possibility of efficiently selecting several candidates for accurate search. We sketch preliminary experiments on robust face and digit recognition to corroborate our theory.

artificial intelligence, probability, subspace, (15 more...)

arXiv.org Machine Learning

1211.0757

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Vision > Face Recognition (0.31)

Add feedback

Full Object Boundary Detection by Applying Scale Invariant Features in a Region Merging Segmentation Algorithm

Oji, Reza, Tajeripour, Farshad

arXiv.org Artificial IntelligenceOct-25-2012

Object detection is a fundamental task in computer vision and has many applications in image processing. This paper proposes a new approach for object detection by applying scale invariant feature transform (SIFT) in an automatic segmentation algorithm. SIFT is an invariant algorithm respect to scale, translation and rotation. The features are very distinct and provide stable keypoints that can be used for matching an object in different images. At first, an object is trained with different aspects for finding best keypoints. The object can be recognized in the other images by using achieved keypoints. Then, a robust segmentation algorithm is used to detect the object with full boundary based on SIFT keypoints. In segmentation algorithm, a merging role is defined to merge the regions in image with the assistance of keypoints. The results show that the proposed approach is reliable for object detection and can extract object boundary well.

artificial intelligence, image understanding, keypoint, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.5121/ijaia.2012.3504

1210.7038

Country: Asia > Middle East > Iran (0.14)

Technology: Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)

Add feedback

Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials

Krähenbühl, Philipp, Koltun, Vladlen

arXiv.org Artificial IntelligenceOct-20-2012

Most state-of-the-art techniques for multi-class image segmentation and labeling use conditional random fields defined over pixels or image regions. While region-level models often feature dense pairwise connectivity, pixel-level models are considerably larger and have only permitted sparse graph structures. In this paper, we consider fully connected CRF models defined on the complete set of pixels in an image. The resulting graphs have billions of edges, making traditional inference algorithms impractical. Our main contribution is a highly efficient approximate inference algorithm for fully connected CRF models in which the pairwise edge potentials are defined by a linear combination of Gaussian kernels. Our experiments demonstrate that dense connectivity at the pixel level substantially improves segmentation and labeling accuracy.

algorithm, artificial intelligence, image understanding, (17 more...)

arXiv.org Artificial Intelligence

1210.5644

Country: North America > United States > California (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.46)

Add feedback

Disentangling Factors of Variation via Generative Entangling

Desjardins, Guillaume, Courville, Aaron, Bengio, Yoshua

arXiv.org Machine LearningOct-19-2012

Here we propose a novel model family with the objective of learning to disentangle the factors of variation in data. Our approach is based on the spike-and-slab restricted Boltzmann machine which we generalize to include higher-order interactions among multiple latent variables. Seen from a generative perspective, the multiplicative interactions emulates the entangling of factors of variation. Inference in the model can be seen as disentangling these generative factors. Unlike previous attempts at disentangling latent factors, the proposed model is trained using no supervised information regarding the latent factors. We apply our model to the task of facial expression classification.

deep learning, interaction, neural network, (20 more...)

arXiv.org Machine Learning

1210.5474

Country: North America > Canada (0.28)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models (0.90)

Add feedback

Nested Dictionary Learning for Hierarchical Organization of Imagery and Text

Li, Lingbo, Zhang, XianXing, Zhou, Mingyuan, Carin, Lawrence

arXiv.org Machine LearningOct-16-2012

A tree-based dictionary learning model is developed for joint analysis of imagery and associated text. The dictionary learning may be applied directly to the imagery from patches, or to general feature vectors extracted from patches or superpixels (using any existing method for image feature extraction). Each image is associated with a path through the tree (from root to a leaf), and each of the multiple patches in a given image is associated with one node in that path. Nodes near the tree root are shared between multiple paths, representing image characteristics that are common among different types of images. Moving toward the leaves, nodes become specialized, representing details in image classes. If available, words (text) are also jointly modeled, with a path-dependent probability over words. The tree structure is inferred via a nested Dirichlet process, and a retrospective stick-breaking sampler is used to infer the tree depth and width.

bayesian inference, image understanding, node, (21 more...)

arXiv.org Machine Learning

1210.4872

Country: North America > United States (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.88)
(2 more...)

Add feedback

Unsupervised Detection and Tracking of Arbitrary Objects with Dependent Dirichlet Process Mixtures

Neiswanger, Willie, Wood, Frank

arXiv.org Machine LearningOct-11-2012

This paper proposes a technique for the unsupervised detection and tracking of arbitrary objects in videos. It is intended to reduce the need for detection and localization methods tailored to specific object types and serve as a general framework applicable to videos with varied objects, backgrounds, and image qualities. The technique uses a dependent Dirichlet process mixture (DDPM) known as the Generalized Polya Urn (GPUDDPM) to model image pixel data that can be easily and efficiently extracted from the regions in a video that represent objects. This paper describes a specific implementation of the model using spatial and color pixel data extracted via frame differencing and gives two algorithms for performing inference in the model to accomplish detection and tracking. This technique is demonstrated on multiple synthetic and benchmark video datasets that illustrate its ability to, without modification, detect and track objects with diverse physical characteristics moving over non-uniform backgrounds and through occlusion.

artificial intelligence, bayesian inference, video, (15 more...)

arXiv.org Machine Learning

1210.3288

Country: North America > Canada (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
(2 more...)

Add feedback

AI@NICTA

AI MagazineOct-11-2012

NICTA is Australia's Information and Communications Technology (ICT) Centre of Excellence. It is the largest organization in Australia dedicated to ICT research. While it has close links with local universities, it is in fact an independent but not-for-profit company in the business of doing research, commercializing that research and training PhD students to do that research. Much of the work taking place at NICTA involves various topics in artificial intelligence. In this article, we survey some of the AI work being undertaken at NICTA.

constraint-based reasoning, logic programming, nicta, (23 more...)

AI Magazine

Country:

Oceania > Australia (1.00)
Europe (1.00)
North America > United States > California (0.29)

Genre: Instructional Material (0.69)

Industry:

Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.93)
Information Technology (0.93)
Transportation (0.68)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
(6 more...)

Add feedback

Towards an Empathizing and Adaptive Storyteller System

Bae, Byung Chull (IT University of Copenhagen) | Brunete, Alberto (Carlos III University) | Malik, Usman (National University of Sciences and Technology) | Dimara, Evanthia (Université Paris-Sud) | Jermsurawong, Jermsak (New York University Abu Dhabi) | Mavridis, Nikolaos ( New York University Abu Dhabi )

AAAI ConferencesOct-7-2012

This paper describes our ongoing effort to build an empathizing and adaptive storyteller system. The system under development aims to utilize emotional expressions generated from an avatar or a humanoid robot in addition to the listener’s responses which are monitored in real time, in order to deliver a story in an effective manner. We conducted a pilot study and the results were analyzed in two ways: first, through a survey questionnaire analysis based on the participant’s subjective ratings; second, through automated video analysis based on the participant’s emotional facial expression and eye blinking. The survey questionnaire results show that male participants have a tendency of more empathizing with a story character when a virtual storyteller is present, as compared to audio-only narration. The video analysis results show that the number of eye blinking of the participants is thought to be reciprocal to their attention.

artificial intelligence, listener, storyteller, (15 more...)

AAAI Conferences

Eighth Artificial Intelligence and Interactive Digital Entertainment Conference

Country:

Europe > France (0.15)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.15)
Europe > Denmark (0.14)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > New Finding (0.55)

Technology: Information Technology > Artificial Intelligence > Vision > Face Recognition (0.50)

Add feedback

Examples of Artificial Perceptions in Optical Character Recognition and Iris Recognition

Noaica, Cristina M., Badea, Robert, Motoc, Iulia M., Ghica, Claudiu G., Rosoiu, Alin C., Popescu-Bodorin, Nicolaie

arXiv.org Artificial IntelligenceSep-27-2012

This paper assumes the hypothesis that human learning is perception based, and consequently, the learning process and perceptions should not be represented and investigated independently or modeled in different simulation spaces. In order to keep the analogy between the artificial and human learning, the former is assumed here as being based on the artificial perception. Hence, instead of choosing to apply or develop a Computational Theory of (human) Perceptions, we choose to mirror the human perceptions in a numeric (computational) space as artificial perceptions and to analyze the interdependence between artificial learning and artificial perception in the same numeric space, using one of the simplest tools of Artificial Intelligence and Soft Computing, namely the perceptrons. As practical applications, we choose to work around two examples: Optical Character Recognition and Iris Recognition. In both cases a simple Turing test shows that artificial perceptions of the difference between two characters and between two irides are fuzzy, whereas the corresponding human perceptions are, in fact, crisp.

fuzzy logic, neural network, perception, (15 more...)

arXiv.org Artificial Intelligence

1209.6195

Country:

Europe > Romania (0.15)
Europe > Hungary (0.14)

Genre: Research Report (0.50)

Industry: Energy > Oil & Gas (0.94)

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.62)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.38)

Add feedback