AITopics

1603.08564

Country: North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report (0.64)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.47)

#artificialintelligenceMar-27-2016, 19:30:53 GMT

Accord.NET Machine Learning Framework

The Accord.NET Framework is a .NET machine learning framework combined with audio and image processing libraries completely written in C#. It is a complete framework for building production-grade computer vision, computer audition, signal processing and statistics applications even for commercial use. A comprehensive set of sample applications provide a fast start to get up and running quickly, and an extensive documentation and wiki helps fill in the details.

accord, artificial intelligence, net machine learning framework

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.82)
Information Technology > Artificial Intelligence > Machine Learning (0.74)

#artificialintelligenceMar-26-2016, 17:25:16 GMT

Radiology to gain from artificial intelligence in healthcare

Volume of studies The amount of captured and stored medical images is increasing. As the quantity of images has gone up, so too has the amount of time it takes radiologists to review the data. Artificial intelligence in healthcare can take some of this work away from radiologists by processing images and scanning medical studies to quickly detect patterns or abnormalities that could be missed by the naked eye. The artificial intelligence system could then pass the results of its review to a radiologist for confirmation. Reporting and classification Natural language processing is another technology radiologists can use to assist them with documentation and reporting.

clinical medicine, health & medicine, healthcare, (9 more...)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.88)

@machinelearnbotMar-23-2016, 09:30:54 GMT

Are there any efficient (the forward speed is much faster than AlexNet) models that attain at least the same performance as AlexNet for image classification? • /r/MachineLearning

Are there any efficient (the forward speed is much faster than AlexNet) models that attain at least the same performance as AlexNet for image classification? Look up for model compression: there were discussions on this subreddit where people much more competent than me suggested literature for that. First paper that comes to mind: http://arxiv.org/abs/1504.04788 Check out SqueezeNet, although the focus here is more on the number of parameters/deployability rather than inference speed: http://arxiv.org/abs/1602.07360

alexnet, artificial intelligence, image understanding, (5 more...)

@machinelearnbot

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.69)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.69)

#artificialintelligenceMar-21-2016, 23:40:54 GMT

Neuromorphic Chips: Using Animal Brains as a Model for Computing

Strong interest in Artificial Intelligence and Machine Learning is driving rapid advances into the basic elements of computers are architected. GPUs are one example -- a GPU consists of a large number of processor cores that can all work in parallel and are tuned to be very performant when operating on very specific kinds problems, like image processing. While originally developed primarily for graphic processing, GPU's are increasingly being used for other computationally intensive problems in machine learning. Our current concept for how a computer works was first conceived by Turing and von Neumann in the 1940's. In the von Neumann model for computing, there is a central processing unit or CPU that uses internal registers for processing data.

artificial intelligence, machine learning, neuromorphic chip, (6 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.74)
Information Technology > Artificial Intelligence > Machine Learning (0.64)

#artificialintelligenceMar-21-2016, 09:07:40 GMT

IBM's Automated Radiologist Can Read Images and Medical Records

Most smart software in use today specializes on one type of data, be that interpreting text or guessing at the content of photos. Software in development at IBM has to do all those at once. It's in training to become a radiologist's assistant. The software is code-named Avicenna, after an 11th century philosopher who wrote an influential medical encyclopedia. It can identify anatomical features and abnormalities in medical images such as CT scans, and also draws on text and other data in a patient's medical record to suggest possible diagnoses and treatments.

avicenna, health & medicine, medical record, (16 more...)

Country: North America > United States > California (0.16)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.81)
Information Technology > Artificial Intelligence > Machine Learning (0.73)

arXiv.org Machine LearningMar-8-2016

Discriminative models for robust image classification

Srinivas, Umamahesh

A variety of real-world tasks involve the classification of images into pre-determined categories. Designing image classification algorithms that exhibit robustness to acquisition noise and image distortions, particularly when the available training data are insufficient to learn accurate models, is a significant challenge. This dissertation explores the development of discriminative models for robust image classification that exploit underlying signal structure, via probabilistic graphical models and sparse signal representations. Probabilistic graphical models are widely used in many applications to approximate high-dimensional data in a reduced complexity set-up. Learning graphical structures to approximate probability distributions is an area of active research. Recent work has focused on learning graphs in a discriminative manner with the goal of minimizing classification error. In the first part of the dissertation, we develop a discriminative learning framework that exploits the complementary yet correlated information offered by multiple representations (or projections) of a given signal/image. Specifically, we propose a discriminative tree-based scheme for feature fusion by explicitly learning the conditional correlations among such multiple projections in an iterative manner. Experiments reveal the robustness of the resulting graphical model classifier to training insufficiency.

immunology, information fusion, representation, (25 more...)

1603.02736

Country:

Europe (1.00)
North America > United States > California (0.27)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Education (1.00)
Energy (0.67)
(5 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
(6 more...)

Da Xu, Richard Yi, Caron, Francois, Doucet, Arnaud

Bayesian nonparametric image segmentation using a generalized Swendsen-Wang algorithm

arXiv.org Machine LearningFeb-9-2016

Unsupervised image segmentation aims at clustering the set of pixels of an image into spatially homogeneous regions. We introduce here a class of Bayesian nonparametric models to address this problem. These models are based on a combination of a Potts-like spatial smoothness component and a prior on partitions which is used to control both the number and size of clusters. This class of models is flexible enough to include the standard Potts model and the more recent Potts-Dirichlet Process model \cite{Orbanz2008}. More importantly, any prior on partitions can be introduced to control the global clustering structure so that it is possible to penalize small or large clusters if necessary. Bayesian computation is carried out using an original generalized Swendsen-Wang algorithm. Experiments demonstrate that our method is competitive in terms of RAND\ index compared to popular image segmentation methods, such as mean-shift, and recent alternative Bayesian nonparametric models.

algorithm, artificial intelligence, bayesian inference, (17 more...)

1602.03048

Country:

Oceania > Australia (0.14)
Europe > United Kingdom (0.14)

Genre:

Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Laparra, Valero, Camps-Valls, Gustavo, Malo, Jesús

Iterative Gaussianization: from ICA to Random Rotations

arXiv.org Machine LearningJan-31-2016

Most signal processing problems involve the challenging task of multidimensional probability density function (PDF) estimation. In this work, we propose a solution to this problem by using a family of Rotation-based Iterative Gaussianization (RBIG) transforms. The general framework consists of the sequential application of a univariate marginal Gaussianization transform followed by an orthonormal transform. The proposed procedure looks for differentiable transforms to a known PDF so that the unknown PDF can be estimated at any point of the original domain. In particular, we aim at a zero mean unit covariance Gaussian for convenience. RBIG is formally similar to classical iterative Projection Pursuit (PP) algorithms. However, we show that, unlike in PP methods, the particular class of rotations used has no special qualitative relevance in this context, since looking for interestingness is not a critical issue for PDF estimation. The key difference is that our approach focuses on the univariate part (marginal Gaussianization) of the problem rather than on the multivariate part (rotation). This difference implies that one may select the most convenient rotation suited to each practical application. The differentiability, invertibility and convergence of RBIG are theoretically and experimentally analyzed. Relation to other methods, such as Radial Gaussianization (RG), one-class support vector domain description (SVDD), and deep neural networks (DNN) is also pointed out. The practical performance of RBIG is successfully illustrated in a number of multidimensional problems such as image synthesis, classification, denoising, and multi-information estimation.

deep learning, gaussianization, neural network, (19 more...)

doi: 10.1109/TNN.2011.2106511

1602.00229

Country:

Europe (0.93)
North America > United States (0.46)

Genre: Research Report (0.82)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Laparra, Valero, Jiménez, Sandra, Camps-Valls, Gustavo, Malo, Jesús

Nonlinearities and Adaptation of Color Vision from Sequential Principal Curves Analysis

arXiv.org Machine LearningJan-31-2016

Mechanisms of human color vision are characterized by two phenomenological aspects: the system is nonlinear and adaptive to changing environments. Conventional attempts to derive these features from statistics use separate arguments for each aspect. The few statistical approaches that do consider both phenomena simultaneously follow parametric formulations based on empirical models. Therefore, it may be argued that the behavior does not come directly from the color statistics but from the convenient functional form adopted. In addition, many times the whole statistical analysis is based on simplified databases that disregard relevant physical effects in the input signal, as for instance by assuming flat Lambertian surfaces. Here we address the simultaneous statistical explanation of (i) the nonlinear behavior of achromatic and chromatic mechanisms in a fixed adaptation state, and (ii) the change of such behavior. Both phenomena emerge directly from the samples through a single data-driven method: the Sequential Principal Curves Analysis (SPCA) with local metric. SPCA is a new manifold learning technique to derive a set of sensors adapted to the manifold using different optimality criteria. A new database of colorimetrically calibrated images of natural objects under these illuminants was collected. The results obtained by applying SPCA show that the psychophysical behavior on color discrimination thresholds, discount of the illuminant and corresponding pairs in asymmetric color matching, emerge directly from realistic data regularities assuming no a priori functional form. These results provide stronger evidence for the hypothesis of a statistically driven organization of color sensors. Moreover, the obtained results suggest that color perception at this low abstraction level may be guided by an error minimization strategy rather than by the information maximization principle.

adaptation, artificial intelligence, machine learning, (17 more...)

doi: 10.1162/NECO_a_00342

1602.00236

Country:

Europe (0.93)
North America > United States (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Education (0.34)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)