Goto

Collaborating Authors

 Instructional Material


Webinar: Manufacturing and Artificial Intelligence: How Computer Vision Drives ROI

#artificialintelligence

Manufacturing enterprises are quickly deploying AI solutions to stay ahead, but how to do scale these advances -- and where to begin -- remain elusive. This talk, moderated by Levatas' head of Data Science, will walk through how to perform human-in-the-loop analysis of unstructured data such as imagery and video footage, and how it could save businesses time and money. Using real examples in NLP and computer vision from other industries, you'll see how it could be possible for your firm to take advantage of these cost-saving technologies in the near-future. We'll walk through what's needed and what kind of results other industries are seeing and what the potential is for this industry. Daniel is an avid technology enthusiast with 15 years of experience designing and architecting software applications.


On Training Recurrent Neural Networks for Lifelong Learning

arXiv.org Artificial Intelligence

Lifelong Machine Learning considers systems that can learn many tasks (from one or more domains) over a lifetime (Thrun, 1998; Silver et al., 2013). This has several names and manifestations in the literature: incremental learning (Solomonoff, 1989), continual learning (Ring, 1997), explanation-based learning (Thrun, 1996, 2012), never ending learning (Carlson et al., 2010), etc. The underlying idea motivating these efforts is the following: Lifelong learning systems would be more effective at learning and retaining knowledge across different tasks. By using the prior knowledge and exploiting similarity acrosstasks, they would be able to obtain better priors for the task at hand. Lifelong learning techniques are very important for training intelligent autonomous agents that would need to operate and make decisions over extended periods of time. These characteristics arespecially important in the industrial setups where the deployed machine learning models are being updated frequently with new incoming data whose distribution neednot match the data on which the model was originally trained. Lifelong learning is an extremely challenging task for the machine learning models because of two primary reasons: 1. Catastrophic Forgetting: As the model is trained on a new task (or a new data/task distribution), it is likely to forget the knowledge it acquired from the previous tasks (or data distributions). This phenomenon is also known as the catastrophic interference (McCloskey and Cohen, 1989).


The Barbados 2018 List of Open Issues in Continual Learning

arXiv.org Artificial Intelligence

We want to make progress toward artificial general intelligence, namely general-purpose agents that autonomously learn how to competently act in complex environments. The purpose of this report is to sketch a research outline, share some of the most important open issues we are facing, and stimulate further discussion in the community. The content is based on some of our discussions during a weeklong workshop held in Barbados in February 2018. We adopt the reinforcement learning (RL) formulation, where an agent interacts sequentially with an environment, and the agent is provided a reward signal that unambiguously defines success. We want to explicitly consider some of the most challenging dimensions for a developing intelligence.


Introduction to PyTorch for Deep Learning

#artificialintelligence

In this tutorial, you'll get an introduction to deep learning using the PyTorch framework, and by its conclusion, you'll be comfortable applying it to your deep learning models. Facebook launched PyTorch 1.0 early this year with integrations for Google Cloud, AWS, and Azure Machine Learning. In this tutorial, I assume that you're already familiar with Scikit-learn, Pandas, NumPy, and SciPy. These packages are important prerequisites for this tutorial. Deep learning is a subfield of machine learning with algorithms inspired by the working of the human brain.


Knowledge Tracing Machines: Factorization Machines for Knowledge Tracing

arXiv.org Artificial Intelligence

Knowledge tracing is a sequence prediction problem where the goal is to predict the outcomes of students over questions as they are interacting with a learning platform. By tracking the evolution of the knowledge of some student, one can optimize instruction. Existing methods are either based on temporal latent variable models, or factor analysis with temporal features. We here show that factorization machines (FMs), a model for regression or classification, encompasses several existing models in the educational literature as special cases, notably additive factor model, performance factor model, and multidimensional item response theory. We show, using several real datasets of tens of thousands of users and items, that FMs can estimate student knowledge accurately and fast even when student data is sparsely observed, and handle side information such as multiple knowledge components and number of attempts at item or skill level. Our approach allows to fit student models of higher dimension than existing models, and provides a testbed to try new combinations of features in order to improve existing models.


Andrew Ng launches 'AI for Everyone,' a new Coursera program aimed at business professionals

#artificialintelligence

Andrew Ng, a computer scientist who led Google's AI division, Google Brain, and formerly served as vice president and chief scientist at Baidu, is a veritable celebrity in the artificial intelligence (AI) industry. After leaving Baidu, he debuted an online curriculum of classes centered around machine learning -- Deeplearning.ai Ng was the keynote speaker at the AI Frontiers Conference in November 2017, and this year unveiled the AI Fund, a $175 million incubator that backs small teams of experts looking to solve key problems using machine learning. Oh, and he's also chairman of AI cognitive behavioral therapy startup Woebot; sits on the board of driverless car company Drive.ai; Yet somehow, he found time to put together a new online training course -- "AI for Everyone" -- that seeks to demystify AI for business executives.


6 Data Science & ML Books Every Data Scientist Should Keep Nearby

#artificialintelligence

The best way to stay in touch is to continue brushing up on your knowledge about data science while also maintaining experience. It's the perfect storm or combination of skills to help you succeed in the industry. Machine learning and data science are a complicated and involved set of interconnected concepts. To keep up, you need to be prepared to spend time doing research and brushing up on knowledge. Even working in the industry day in and day out, there are still ways to fall out of touch with the current trends.



How to Speed Up Deep Learning Inference Using TensorRT NVIDIA Developer Blog

#artificialintelligence

Welcome to this introduction to TensorRT, our platform for deep learning inference. You will learn how to deploy a deep learning application onto a GPU, increasing throughput and reducing latency during inference. TensorRT provides APIs and parsers to import trained models from all major deep learning frameworks. It then generates optimized runtime engines deployable in the datacenter as well as in automotive and embedded environments. Applications deployed on GPUs with TensorRT perform up to 40x faster than CPU-only platforms. This tutorial uses a C example to walk you through importing an ONNX model into TensorRT, applying optimizations, and generating a high-performance runtime engine for the datacenter environment.


Modular Networks: Learning to Decompose Neural Computation

arXiv.org Artificial Intelligence

Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number of parameters with a relatively small increase in resources. We propose a training algorithm that flexibly chooses neural modules based on the data to be processed. Both the decomposition and modules are learned end-to-end. In contrast to existing approaches, training does not rely on regularization to enforce diversity in module use. We apply modular networks both to image recognition and language modeling tasks, where we achieve superior performance compared to several baselines. Introspection reveals that modules specialize in interpretable contexts.