AITopics | Deep Learning

Data preprocessing for deep learning with nuts-ml

#artificialintelligenceJun-3-2017, 20:25:23 GMT

Data preprocessing is a fundamental part of any machine learning project and often more time is spent on the data preparation than on the actual machine learning. While some preprocessing tasks are problem specific many others such as partitioning data into training and test folds, stratifying samples or building mini-batches are generic. The following Canonical Pipeline shows the processing steps common for deep-learning in vision. A Reader reads sample data stored in text files, Excel or Pandas tables. The Splitter then partitions data into training, validation and test folds and performs stratification if needed. Usually not all image data can be loaded into memory and a Loader loads images on demand. These images are often processed by a Transformer, for resizing, cropping or other adjustments. Furthermore, to increase the training set additional images are synthesized by randomly augmenting (flipping, rotating, …) images using an Augmenter. Efficient, GPU-based machine learning demands that image and label data are grouped in mini-batches via a Batcher before passed on to the Network for training or inference. Finally, to keep track of the training progress, usually a Logger is employed to write training losses or accuracies to a log file. Some machine learning frameworks such as Keras provide (some of) these preprocessing components hidden behind an API that considerably simplify network training if it fits the task at hand. See the following excerpt of a Keras example to train a model with augmentation. However, what if an image format, augmentation or other preprocessing capability is needed that is not provided by the API? Extending a library such as Keras or others is not trivial and the common approach is to (re)implement the required functionality – often in a quick-and-dirty fashion. But implementing a robust data pipeline that loads, transforms, augments and processes images on demand is challenging and time consuming. The following excerpt from a nuts-ml example shows a pipeline for network training, where the operator defines the flow of data. In the example above training images are augmented, pixel values re-ranged, and the samples shuffled before building batches for network training. Finally, the mean over the batch-wise training losses is computed and printed. The resulting code is more readable and can readily be modified to experiment with different preprocessing schemes. Task-specific functions can easily be implement as nuts and added to the data flow. Any machine learning library that accepts mini-batches of Numpy arrays for training or inference is compatible. For more information about nuts-ml see the Introduction and have a look at the Tutorial. Bio: Stefan Maetschke (PhD) is a research scientist at IBM Research Australia where he develops machine learning infrastructure and models for medical image analysis.

deep learning, neural network, training, (18 more...)

#artificialintelligence

Genre: Press Release (0.57)

Industry: Information Technology > Software (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)

Add feedback

Data preprocessing for deep learning with nuts-ml

#artificialintelligenceJun-3-2017, 20:25:23 GMT

Data preprocessing is a fundamental part of any machine learning project and often more time is spent on the data preparation than on the actual machine learning. While some preprocessing tasks are problem specific many others such as partitioning data into training and test folds, stratifying samples or building mini-batches are generic. The following Canonical Pipeline shows the processing steps common for deep-learning in vision. A Reader reads sample data stored in text files, Excel or Pandas tables. The Splitter then partitions data into training, validation and test folds and performs stratification if needed. Usually not all image data can be loaded into memory and a Loader loads images on demand. These images are often processed by a Transformer, for resizing, cropping or other adjustments. Furthermore, to increase the training set additional images are synthesized by randomly augmenting (flipping, rotating, …) images using an Augmenter. Efficient, GPU-based machine learning demands that image and label data are grouped in mini-batches via a Batcher before passed on to the Network for training or inference. Finally, to keep track of the training progress, usually a Logger is employed to write training losses or accuracies to a log file. Some machine learning frameworks such as Keras provide (some of) these preprocessing components hidden behind an API that considerably simplify network training if it fits the task at hand. See the following excerpt of a Keras example to train a model with augmentation. However, what if an image format, augmentation or other preprocessing capability is needed that is not provided by the API? Extending a library such as Keras or others is not trivial and the common approach is to (re)implement the required functionality – often in a quick-and-dirty fashion. But implementing a robust data pipeline that loads, transforms, augments and processes images on demand is challenging and time consuming. The following excerpt from a nuts-ml example shows a pipeline for network training, where the operator defines the flow of data. In the example above training images are augmented, pixel values re-ranged, and the samples shuffled before building batches for network training. Finally, the mean over the batch-wise training losses is computed and printed. The resulting code is more readable and can readily be modified to experiment with different preprocessing schemes. Task-specific functions can easily be implement as nuts and added to the data flow. Any machine learning library that accepts mini-batches of Numpy arrays for training or inference is compatible. For more information about nuts-ml see the Introduction and have a look at the Tutorial. Bio: Stefan Maetschke (PhD) is a research scientist at IBM Research Australia where he develops machine learning infrastructure and models for medical image analysis.

deep learning, neural network, training, (18 more...)

#artificialintelligence

Genre: Press Release (0.57)

Industry: Information Technology > Software (0.74)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.74)

Add feedback

How AI and Deep Learning Help Explain Human Fear - iQ by Intel

#artificialintelligenceJun-3-2017, 20:25:20 GMT

Researchers are breaking down the barrier between people and machines by teaching computers to recognize fear. On the 4th floor of the pristine Media Lab Complex at MIT lives a Nightmare Machine. These computers earned that nickname for a reason: they have been learning how to terrify people. A series of algorithms generates disturbing and grotesque images, like movie monsters, dead people, and other things that go bump in the night. "We wanted to playfully explore how artificial intelligence (AI) can become a demon that learns how to scare you," said Pinar Yanardag Delul, one of the creators of the gore-loving computer program.

artificial intelligence, computer, machine learning, (12 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.36)

Industry:

Transportation > Passenger (0.75)
Transportation > Ground > Road (0.51)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.31)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.51)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.41)

Add feedback

When AI Tech Turned a Corner -- Google's Deep Mind AlphaGo Beating Lee Sedol

#artificialintelligenceJun-3-2017, 20:25:13 GMT

I meant to write this blog several months ago. However as a compulsive procrastinator I kept putting it to another day, until today. On January 27, an article in Nature reported on a computer that had beat a human player at Go. It is an ancient board game that has long been viewed as a hard nut to crack for Artificial Intelligence (AI). Till then, computers had already beaten the best human players of backgammon, draughts, and chess.

artificial intelligence, computer, machine learning, (15 more...)

#artificialintelligence

Country: North America > United States > California (0.06)

Industry: Leisure & Entertainment > Games > Go (0.66)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)
Information Technology > Artificial Intelligence > Games > Go (0.53)

Add feedback

Scientists slash computations for deep learning

#artificialintelligenceJun-3-2017, 18:45:32 GMT

Rice University computer scientists have adapted a widely used technique for rapid data lookup to slash the amount of computation--and thus energy and time--required for deep learning, a computationally intense form of machine learning. "This applies to any deep-learning architecture, and the technique scales sublinearly, which means that the larger the deep neural network to which this is applied, the more the savings in computations there will be," said lead researcher Anshumali Shrivastava, an assistant professor of computer science at Rice. The research will be presented in August at the KDD 2017 conference in Halifax, Nova Scotia. It addresses one of the biggest issues facing tech giants like Google, Facebook and Microsoft as they race to build, train and deploy massive deep-learning networks for a growing body of products as diverse as self-driving cars, language translators and intelligent replies to emails. Shrivastava and Rice graduate student Ryan Spring have shown that techniques from "hashing," a tried-and-true data-indexing method, can be adapted to dramatically reduce the computational overhead for deep learning.

artificial intelligence, machine learning, neuron, (12 more...)

#artificialintelligence

Country: North America > Canada > Nova Scotia > Halifax Regional Municipality > Halifax (0.25)

Genre: Research Report (0.36)

Industry:

Information Technology (0.56)
Education (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

When Does Deep Learning Work Better Than SVMs or Random Forests?

@machinelearnbotJun-3-2017, 15:25:13 GMT

If we tackle a supervised learning problem, my advice is to start with the simplest hypothesis space first. I.e., try a linear model such as logistic regression. If this doesn't work "well" (i.e., it doesn't meet our expectation or performance criterion that we defined earlier), I would move on to the next experiment. I would say that random forests are probably THE "worry-free" approach - if such a thing exists in ML: There are no real hyperparameters to tune (maybe except for the number of trees; typically, the more trees we have the better). On the contrary, there are a lot of knobs to be turned in SVMs: Choosing the "right" kernel, regularization penalties, the slack variable, ... Both random forests and SVMs are non-parametric models (i.e., the complexity grows as the number of training samples increases).

artificial intelligence, machine learning, svm, (13 more...)

@machinelearnbot

Country: North America > United States > Michigan (0.05)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.62)

Add feedback

Microsoft's upgraded deep learning tools released to the world

#artificialintelligenceJun-3-2017, 13:40:07 GMT

The Microsoft Cognitive Toolkit was originally launched in beta back in October 2016. It provides a set of robust intelligent services that enable developers to draw on AI within their own products. The revised version launched this week includes "hundreds" of new features that combine to make a simpler and more powerful toolkit. Microsoft originally built the Cognitive Toolkit to aid the development of its own internal services. It was devised as a method of accelerating the previously costly training procedures required to launch new neural networks.

artificial intelligence, machine learning, microsoft, (7 more...)

#artificialintelligence

Country: North America > United States > Maryland > Anne Arundel County > Annapolis (0.06)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

Understanding the Basics of Deep Learning and Neural Networks

#artificialintelligenceJun-3-2017, 11:55:18 GMT

Last week I had the opportunity to visit my graduate school alma mater, The University of Arizona where I studied artificial intelligence and image processing many years ago. I remember signing up for my first semester classes and electing to challenge myself with Professor Neifeld's neural network class. It already had the reputation of being one of the toughest classes requiring students to understand both the mathematical theory and real-world application of neural networks to solve classification and other problems. Neural Networks Before Cloud Computing Of course back then there wasn't cloud computing or easy access to parallel computing methods or deep learning Python libraries. As students, we had to program the algorithms by hand starting with the mathematics of a single neuron, the iterations to loop through all the neurons in each layer, and the algorithms to implement the backpropagation learning algorithms.

algorithm, artificial intelligence, machine learning, (14 more...)

#artificialintelligence

Country: North America > United States > Arizona (0.25)

Industry: Education (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.65)

Add feedback

Demystifying AI, ML, DL with Vishal Sikka and real world examples

#artificialintelligenceJun-3-2017, 11:55:09 GMT

The technology industry is plagued with buzzword bingo in support of the fashion driven nature of the technology beast. Often confusing and occasionally downright ridiculous, we're never going to prevent smart ass marketers, ably supported by their anal-yst surrogates from making stuff up. The least some of us can do is make clear what is under discussion without mindlessly parroting what others say or conflating one concept with another. The latest in this stream of marketing laden garbage is AI or Artificial Intelligence, smeared with ML or Machine Learning and DL or Deep Learning. Add a soupçon of'robotics' just to amp the volume to something people can'get' and you have the potential for an exotic mix that both captivates the sentient mind but can also plant fear.

artificial intelligence, deep learning, machine learning, (16 more...)

#artificialintelligence

Country: