Deep Learning
Optimal deep neural networks for sparse recovery via Laplace techniques
Limmer, Steffen, Stanczak, Slawomir
This paper introduces Laplace techniques for designing a neural network, with the goal of estimating simplex-constraint sparse vectors from compressed measurements. To this end, we recast the problem of MMSE estimation (w.r.t. a pre-defined uniform input distribution) as the problem of computing the centroid of some polytope that results from the intersection of the simplex and an affine subspace determined by the measurements. Owing to the specific structure, it is shown that the centroid can be computed analytically by extending a recent result that facilitates the volume computation of polytopes via Laplace transformations. A main insight of this paper is that the desired volume and centroid computations can be performed by a classical deep neural network comprising threshold functions, rectified linear (ReLU) and rectified polynomial (ReP) activation functions. The proposed construction of a deep neural network for sparse recovery is completely analytic so that time-consuming training procedures are not necessary. Furthermore, we show that the number of layers in our construction is equal to the number of measurements which might enable novel low-latency sparse recovery algorithms for a larger class of signals than that assumed in this paper. To assess the applicability of the proposed uniform input distribution, we showcase the recovery performance on samples that are soft-classification vectors generated by two standard datasets. As both volume and centroid computation are known to be computationally hard, the network width grows exponentially in the worst-case. It can be, however, decreased by inducing sparse connectivity in the neural network via a well-suited basis of the affine subspace. Finally, the presented analytical construction may serve as a viable initialization to be further optimized and trained using particular input datasets at hand.
Building AI Superclusters in Canada โ Synced โ Medium
The breakthrough came in 2006: Hinton led a published paper called A Fast Learning Algorithm for Deep Belief Nets, which first proposed the method of greedy layer-wise training for deep neural networks. In an competition run by ImageNet in 2012, Hinton's UofT team used convolutional neural networks (CNN) for image recognition application. Given the large pool of image datasets and the computation power of GPU processors, the team was on the right track, and their results redefined the field of computer vision. Two of Hinton's earliest correspondents were Yoshua Bengio from the University of Montreal and his own postdoc student Yan Lecun, who joined Hinton's UofT lab in 1987 and now leads AI research at Facebook. The accomplished trio is sometimes jokingly referred to as the "Canadian Mafia" of deep learning.
Build your 1st Deep Learning Rig โ Prolego blog
In part 2 I explained why you need an AI Sandbox. In this post I'll explain how to build your deep learning rig, the hardware portion of your AI Sandbox. When I started my career hardware was a big part of working in technology. Buying, hosting, installing, and configuring hardware was just part of the job. Times have changed and today almost all development happens in the cloud.
Announcing tools for the AI-driven digital transformation
Artificial Intelligence (AI) has emerged as one of the most disruptive forces behind the digital transformation of business. Today, at Microsoft Ignite 2017, as we engage in conversations about digital transformation with over 25,000 customers and partners, I am pleased to share some of our latest and most exciting innovations in AI development platforms. These announcements โ which span Azure Machine Learning (AML), new Visual Studio tools for AI, Cognitive Services and new enterprise AI solutions โ demonstrate our mission to bring AI to every developer and every organization on the planet, and to help businesses augment human ingenuity in unique and differentiated ways. Today we are announcing a set of powerful new capabilities in AML for developers to exploit big data, GPUs, data wrangling and container based model deployment. Let me tell you more about these below and for a deep dive please review this AML blog. The AML Workbench is a cross-platform client application that runs on Windows and Mac and serves as a control panel for your development lifecycle.
Microsoft launches new machine learning tools
Microsoft, just like many of its competitors, has gone all in on machine learning. That emphasis is on full display at the company's Ignite conference this where, where the company today announced a number of new tools for developers who want to build new A.I. models and users who simply want to make use of these pre-existing models -- either from their own teams or from Microsoft. For developers, the company launched three major new tools today: the Azure Machine Learning Experimentation service, the Azure Machine Learning Workbench and the Azure Machine Learning Model Management service. In addition, Microsoft also launched a new set of tools for developers who want to use its Visual Studio Code IDE for building models with CNTK, TensorFlow, Theano, Keras and Caffe2. And for non-developers, Microsoft is also bringing Azure-based machine learning models to Excel users, who will now be able to call up the AI functions that their company's data scientists have created right from their spreadsheets.
Decoding Apple iPhone X's FaceID & How Face Recognition Tech Will Soon Make Passwords Extinct
After Apple unveiled the much awaited iPhone X, the entire world has been awestruck with the device's ability to unlock a smartphone by simply looking at it -- through FaceID. And while many might feel that it's quite revolutionary, the principles of face recognition technology were already in place for quite some time now. Amidst privacy and security concerns, let's see how it works. Facial Recognition is the ability of an application or device to detect and recognise a specific face, which can be used for numerous applications. Basically, facial recognition technology allows users to gain access by simply scanning the face.
Two New Courses are Now Available for Machine Learning and Deep Learning on AWS Amazon Web Services
AWS Training and Certification helps you advance your knowledge with practical skills so you can get more out of the AWS Cloud. We now have two new courses to help you learn about how to leverage artificial intelligence (AI) solutions using AWS: Introduction to Machine Learning web-based training and Deep Learning on AWS instructor-led training. If you are looking to learn more about how you can put AI capabilities to use, we recommend that you start with the web-based training. Developers looking to learn more should then attend the one-day instructor-led training. Introduction to Machine Learning is a free 40 minute web-based training intended for developers, solutions architects, and IT decision makers who already know the foundations of working with AWS.
A Deep Learning Tutorial: From Perceptrons to Deep Networks
In this tutorial, I'll introduce you to the key concepts and algorithms behind deep learning, beginning with the simplest unit of composition and building to the concepts of machine learning in Java. The single perceptron approach to deep learning has one major drawback: it can only learn linearly separable functions. By the universal approximation theorem, a single hidden layer network with a finite number of neurons can be trained to approximate an arbitrarily random function. You can see a simple (4-2-3 layer) feedforward neural network that classifies the IRIS dataset implemented in Java here through the testMLPSigmoidBP method.
Predicting Portland Home Prices
For my final project at Metis, I wanted to choose something that enabled me to incorporate all that I had learned during the past three months. Predicting Portland home prices allowed me to do this because I was able to incorporate various web scraping techniques, natural language processing on text, deep learning models on images, and gradient boosting into tackling the problem. Below you can see 8,300 single family home sales that I scraped in Portland, OR between July 2016 - July 2017. Obviously, neighborhood plays a large role. The West Hills (in red) are one of the priciest areas in town, whereas East Portland is cheaper.
Generative learning for deep networks
Flach, Boris, Shekhovtsov, Alexander, Fikar, Ondrej
Learning, taking into account full distribution of the data, referred to as generative, is not feasible with deep neural networks (DNNs) because they model only the conditional distribution of the outputs given the inputs. Current solutions are either based on joint probability models facing difficult estimation problems or learn two separate networks, mapping inputs to outputs (recognition) and vice-versa (generation). We propose an intermediate approach. First, we show that forward computation in DNNs with logistic sigmoid activations corresponds to a simplified approximate Bayesian inference in a directed probabilistic multi-layer model. This connection allows to interpret DNN as a probabilistic model of the output and all hidden units given the input. Second, we propose that in order for the recognition and generation networks to be more consistent with the joint model of the data, weights of the recognition and generator network should be related by transposition. We demonstrate in a tentative experiment that such a coupled pair can be learned generatively, modelling the full distribution of the data, and has enough capacity to perform well in both recognition and generation.