neural network

A Brief History of Deep Learning (Part One) - Bulletproof


For example if we had a dataset containing past advertising budgets for various media (TV, Radio and Newspapers) as well as the resulting Sales figures we could train a model to use this information to predict expected Sales figures under various future advertising scenarios. Much of Machine Learning theory centres around data preparation, data sampling techniques, tuning algorithms as well as best practices for training processes to ensure best generalisation and statistical validity of results. The idea was to get computers to simulate this process to build a new kind of machine learning approach: Artificial Neural Networks. It would not be until the early 2000s that the birth of the cloud created a springboard that would catapult Artificial Neural Network research out of its winter and into the realm of Deep Learning.

PyTorch or TensorFlow?


PyTorch is essentially a GPU enabled drop-in replacement for NumPy equipped with higher-level functionality for building and training deep neural networks. In PyTorch the graph construction is dynamic, meaning the graph is built at run-time. TensorFlow does have the dynamic_rnn for the more common constructs but creating custom dynamic computations is more difficult. I haven't found the tools for data loading in TensorFlow (readers, queues, queue runners, etc.)

Microsoft unveils Project Brainwave for real-time AI - Microsoft Research


First, we have defined highly customized, narrow-precision data types that increase performance without real losses in model accuracy. Third, Project Brainwave incorporates a software stack designed to support the wide range of popular deep learning frameworks. Companies and researchers building DNN accelerators often show performance demos using convolutional neural networks (CNNs). Running on Stratix 10, Project Brainwave thus achieves unprecedented levels of demonstrated real-time AI performance on extremely challenging models.

Businesses Turn to AI, Digital Platforms for Competitive Edge


Gartner identified three technology trends it predicts will dominate the enterprise space in the coming years. The trends, or "megatrends," outlined in the recently released Hype Cycle for Emerging Technologies, 2017 (fee charged), will gain prominence and develop over the coming years. In this latest cycle, the company sees digital technologies entering the "Peak of Inflated Expectations Phase," and predicts the use of these technologies in the enterprises is set to explode. Organizations offering IoT/connected technologies reported a 50 percent increase in customer satisfaction, while 44 percent of organizations offering IoT/connected technologies reported satisfaction remaining the same.

AI visionary who teaches humans to teach computers

Daily Mail

Scientist Andrew Ng, right, works with others at his office in Palo Alto, Calif. Ng, one of the world's most renowned researchers in machine learning and artificial intelligence, is facing a dilemma: there aren't enough experts trained to train the machines. He has said he sees AI changing virtually every industry, and any task that takes less than a second of thought will eventually be done by machines. Andrew Ng poses at his office in Palo Alto, Calif. Ng, one of the world's most renowned researchers in machine learning and artificial intelligence, is facing a dilemma: there aren't enough experts trained to train the machines. More recently, he left his high-profile job at Baidu to launch Every time he's started something big, whether it's Coursera, the Google Brain deep learning unit, or Baidu's AI lab, he has left once he felt the teams he has built can carry on without him.

An Intuitive Guide to Deep Network Architectures – Towards Data Science – Medium


However, because the gradient signal in ResNets could travel back directly to early layers via shortcut connections, we could suddenly build 50-layer, 101-layer, 152-layer, and even (apparently) 1000 layer nets that still performed well. In a traditional conv net, each layer extracts information from the previous layer in order to transform the input data into a more useful representation. An Inception module computes multiple different transformations over the same input map in parallel, concatenating their results into a single output. One additional filter means convolving over M more maps; N additional filters means convolving over N*M more maps.

How a new wave of machine learning will impact today's enterprise 7wData


Advances in deep learning and other Machine Learning algorithms are currently causing a tectonic shift in the technology landscape. Betting big on an AI future, cloud providers are investing resources to simplify and promote machine learning to win new cloud customers. First, advances in computing technology (GPU chips and cloud computing, in particular) are enabling engineers to solve problems in ways that weren't possible before. For example, chipmaker NVIDIA has been ramping up production of GPU processors designed specifically to accelerate machine learning, and cloud providers like Microsoft and Google have been using them in their machine learning services.



Last year, Microsoft's speech and dialog research group announced a milestone in reaching human parity on the Switchboard conversational speech recognition task, meaning we had created technology that recognized words in a conversation as well as professional human transcribers. After our transcription system reached the 5.9 percent word error rate that we had measured for humans, other researchers conducted their own study, employing a more involved multi-transcriber process, which yielded a 5.1 human parity word error rate. Today, I'm excited to announce that our research team reached that 5.1 percent error rate with our speech recognition system, a new industry milestone, substantially surpassing the accuracy we achieved last year. While achieving a 5.1 percent word error rate on the Switchboard speech recognition task is a significant achievement, the speech research community still has many challenges to address, such as achieving human levels of recognition in noisy environments with distant microphones, in recognizing accented speech, or speaking styles and languages for which only limited training data is available.

How AI is paving the way for fully autonomous cars


In order to decipher these complex situations, autonomous vehicle developers are turning to artificial neural networks. In place of traditional programming, the network is given a set of inputs and a target output (in this case, the inputs being image data and the output being a particular class of object). The process of training a neural network for semantic segmentation involves feeding it numerous sets of training data with labels to identify key elements, such as cars or pedestrians. Machine learning is already employed for semantic segmentation in driver assistance systems, such as autonomous emergency braking, though.