Goto

Collaborating Authors

 Instructional Material


Educating the next generation of medical professionals with machine learning is essential

#artificialintelligence

"The general public has become quite aware of AI and the impact it can have on health care outcomes such as providing clinicians with improved diagnostics. However, if medical education does not begin to teach medical students about AI and how to apply it into patient care then the advancement of technology will be limited in use and its impact on patient care," explained corresponding author Vijaya B. Kolachalama, PhD, assistant professor of medicine at Boston University School of Medicine (BUSM). Using a PubMed search with'machine learning' as the medical subject heading term, the researchers found that the number of papers published in the area of ML has increased since the beginning of this decade. In contrast, the number of publications related to undergraduate and graduate medical education have remained relatively unchanged since 2010. Realizing the need for educating the students and trainees within the Boston University Medical Campus about ML, Kolachalama designed and taught an introductory course at BUSM.


Machine Learning School in Doha 2018 BigML.com

#artificialintelligence

BigML and the Qatar Computing Research Institute (QCRI), part of Hamad Bin Khalifa University, bring the first edition of our Machine Learning School to Doha, the MLSD18. This event will be one of the first activities to be hosted by QCRI's new Qatar Center for Artificial Intelligence (QCAI). QCAI's mission is to help Qatar realize its vision of becoming a knowledge-based economy by developing and promoting cutting-edge AI innovations for the betterment of human society. We will hold a two-day crash course ideal for business leaders, industry practitioners, developers, graduate students, as well as advanced undergraduates, seeking a quick, practical, and hands-on introduction to Machine Learning to solve real-world problems. The MLSD18 will help attendees to understand how to work in this new wave of innovation that is changing the face of all sectors of the economy.


Queue-based Resampling for Online Class Imbalance Learning

arXiv.org Machine Learning

Online class imbalance learning constitutes a new problem and an emerging research topic that focusses on the challenges of online learning under class imbalance and concept drift. Class imbalance deals with data streams that have very skewed distributions while concept drift deals with changes in the class imbalance status. Little work exists that addresses these challenges and in this paper we introduce queue-based resampling, a novel algorithm that successfully addresses the co-existence of class imbalance and concept drift. The central idea of the proposed resampling algorithm is to selectively include in the training set a subset of the examples that appeared in the past. Results on two popular benchmark datasets demonstrate the effectiveness of queue-based resampling over state-of-the-art methods in terms of learning speed and quality.


An Introduction to Probabilistic Programming

arXiv.org Artificial Intelligence

This document is designed to be a first-year graduate-level introduction to probabilistic programming. It not only provides a thorough background for anyone wishing to use a probabilistic programming system, but also introduces the techniques needed to design and build these systems. It is aimed at people who have an undergraduate-level understanding of either or, ideally, both probabilistic machine learning and programming languages. We start with a discussion of model-based reasoning and explain why conditioning as a foundational computation is central to the fields of probabilistic machine learning and artificial intelligence. We then introduce a simple first-order probabilistic programming language (PPL) whose programs define static-computation-graph, finite-variable-cardinality models. In the context of this restricted PPL we introduce fundamental inference algorithms and describe how they can be implemented in the context of models denoted by probabilistic programs. In the second part of this document, we introduce a higher-order probabilistic programming language, with a functionality analogous to that of established programming languages. This affords the opportunity to define models with dynamic computation graphs, at the cost of requiring inference methods that generate samples by repeatedly executing the program. Foundational inference algorithms for this kind of probabilistic programming language are explained in the context of an interface between program executions and an inference controller. This document closes with a chapter on advanced topics which we believe to be, at the time of writing, interesting directions for probabilistic programming research; directions that point towards a tight integration with deep neural network research and the development of systems for next-generation artificial intelligence applications.


Generative replay with feedback connections as a general strategy for continual learning

arXiv.org Artificial Intelligence

Standard artificial neural networks suffer from the well-known issue of catastrophic forgetting, making continual or lifelong learning problematic. Recently, numerous methods have been proposed for continual learning, but due to differences in evaluation protocols it is difficult to directly compare their performance. To enable more meaningful comparisons, we identified three distinct continual learning scenarios based on whether task identity is known and, if it is not, whether it needs to be inferred. Performing the split and permuted MNIST task protocols according to each of these scenarios, we found that regularization-based approaches (e.g., elastic weight consolidation) failed when task identity needed to be inferred. In contrast, generative replay combined with distillation (i.e., using class probabilities as "soft targets") achieved superior performance in all three scenarios. In addition, we reduced the computational cost of generative replay by integrating the generative model into the main model by equipping it with generative feedback connections. This Replay-through-Feedback approach substantially shortened training time with no or negligible loss in performance. We believe this to be an important first step towards making the powerful technique of generative replay scalable to real-world continual learning applications.


Screencast: Continuous Delivery for Machine Learning with AWS CodePipeline and Amazon SageMaker

#artificialintelligence

The Amazon SageMaker machine learning service is a full platform that greatly simplifies the process of training and deploying your models at scale. However, there are still major gaps to enabling data scientists to do research and development without having to go through the heavy lifting of provisioning the infrastructure and developing their own continuous delivery practices to obtain quick feedback. In this talk, you will learn how to leverage AWS CodePipeline, CloudFormation, CodeBuild, and SageMaker to create continuous delivery pipelines that allow the data scientist to use a repeatable process to build, train, test and deploy their models. Below, I've included a screencast of the talk I gave at the AWS NYC Summit in July 2018 along with a transcript (generated by Amazon Transcribe – another Machine Learning service – along with lots of human editing). The last six minutes of the talk include two demos on using SageMaker, CodePipeline, and CloudFormation as part of the open source solution we created.


AIML

#artificialintelligence

Those who learn how to make machines that exhibit intelligence today are tomorrow going to lead the next technological revolution, be part of the most cutting-edge companies and stand a chance to disrupt almost all industries through their skillsets.


Deep Learning Models for Human Activity Recognition

#artificialintelligence

Human activity recognition, or HAR, is a challenging time series classification task. It involves predicting the movement of a person based on sensor data and traditionally involves deep domain expertise and methods from signal processing to correctly engineer features from the raw data in order to fit a machine learning model. Recently, deep learning methods such as convolutional neural networks and recurrent neural networks have shown capable and even achieve state-of-the-art results by automatically learning features from the raw sensor data. In this post, you will discover the problem of human activity recognition and the deep learning methods that are achieving state-of-the-art performance on this problem. Deep Learning Models for Human Activity Recognition Photo by Simon Harrod, some rights reserved. Human activity recognition, or HAR for short, is a broad field of study concerned with identifying the specific movement or action of a person based on sensor data. Movements are often typical activities performed indoors, such as walking, talking, standing, and sitting.


How to Optimise Ad CTR with Reinforcement Learning Codementor

#artificialintelligence

In this blog we will try to get the basic idea behind reinforcement learning and understand what is a multi arm bandit problem. We will also be trying to maximise CTR(click through rate) for advertisements for a advertising agency. Article includes: 1. Basics of reinforcement learning 2. Types of problems in reinforcement learning 3. Understamding multi-arm bandit problem 4. Basics of conditional probability and Thompson sampling 5. Optimizing ads CTR using Thompson sampling in R Reinforcement Learning Basics Reinforcement learning refers to goal-oriented algorithms, which learn how to attain a complex objective (goal) or maximise along a particular dimension over many steps; for example, maximise the points won in a game over many moves. They can start from a blank slate, and under the right conditions, they achieve superhuman performance. Like a child incentivized by spankings and candy, these algorithms are penalized when they make the wrong decisions and rewarded when they make the right ones -- this is reinforcement.


Personalized Education at Scale

arXiv.org Machine Learning

Tailoring the presentation of information to the needs of individual students leads to massive gains in student outcomes (Bloom 1984). This finding is likely due to the fact that different students learn differently, perhaps as a result of variation in ability, interest or other factors (Schiefele, Krapp, and Winteler 1992). Adapting presentations to the educational needs of an individual has traditionally been the domain of experts, making it expensive and logistically challenging to do at scale, and also leading to inequity in educational outcomes. Increased course sizes and large MOOC enrollments provide an unprecedented access to student data. We propose that emerging technologies in reinforcement learning (RL), as well as semi-supervised learning, natural language processing, and computer vision are critical to leveraging this data to provide personalized education at scale.