Instructional Material
Boston Data Education Meetup
Our First meetup will be focused on H2O, an open-source Machine Learning Platform. It will take place the CIC on Milk Street in Boston.. the Date is set for 5/12 at 6:00PM. NOTE: Bring Laptops if you want to partake in hands on demo. This workshop will provide an overview of how to use H2O, the scalable open source machine learning library, from Python/R/Flow UI. The core algorithms of H2O are implemented in Java, however, fully-featured APIs are available in R, Python, Scala, and also through the Flow UI web interface.
Theano Tutorial - Marek Rei
This is an introductory tutorial on using Theano, the Python library. I'm going to start from scratch and assume no previous knowledge of Theano. However, understanding how neural networks work will be useful when getting to the code examples towards the end. I recently gave this tutorial as a talk in University of Cambridge and it turned out to be way more popular than expected. In order to give more people access to the material, I'm now writing it up as a blog post. I do not claim to know everything about Theano, and I constantly learn new things myself.
Support Vector Machines for Machine Learning - Machine Learning Mastery
Support Vector Machines are perhaps one of the most popular and talked about machine learning algorithms. They were extremely popular around the time they were developed in the 1990s and continue to be the go-to method for a high-performing algorithm with little tuning. In this post you will discover the Support Vector Machine (SVM) machine learning algorithm. SVM is an exciting algorithm and the concepts are relatively simple. This post was written for developers with little or no background in statistics and linear algebra.
The Discovery of Machine Learning
"Any sufficiently advanced technology is indistinguishable from magic" โ Arthur Clarke In the space of technology, this quote has never been so true. Daily discoveries are not uncommon, with yearly revolutions seldom missed. Machine learning has recently been exploding in popularity, with everyone rushing to see how it can benefit their lives, but how did we get to this stage of innovation? This discovery process of machine learning is what you will learn from this article. I was surprised by how much I learned researching machine learning's upbringing, there is enough drama and action to write a novel about it.
From the Lab to the Classroom and Beyond: Extending a Game-Based Research Platform for Teaching AI to Diverse Audiences
Sintov, Nicole (University of Southern California) | Kar, Debarun (University of Southern California) | Nguyen, Thanh (University of Southern California) | Fang, Fei (University of Southern California) | Hoffman, Kevin (Aspire Public Schools) | Lyet, Arnaud (World Wildlife Fund) | Tambe, Milind (University of Southern California)
Recent years have seen increasing interest in AI from outside the AI community. This is partly due to applications based on AI that have been used in real-world domains, for example, the successful deployment of game theory-based decision aids in security domains. This paper describes our teaching approach for introducing the AI concepts underlying security games to diverse audiences. We adapted a game-based research platform that served as a testbed for recent research advances in computational game theory into a set of interactive role-playing games. We guided learners in playing these games as part of our teaching strategy, which also included didactic instruction and interactive exercises on broader AI topics. We describe our experience in applying this teaching approach to diverse audiences, including students of an urban public high school, university undergraduates, and security domain experts who protect wildlife. We evaluate our approach based on results from the games and participant surveys.
A Probabilistic Approach to Knowledge Translation
Jiang, Shangpu (University of Oregon) | Lowd, Daniel (University of Oregon) | Dou, Dejing (University of Oregon )
In this paper, we focus on a novel knowledge reuse scenario where the knowledge in the source schema needs to be translated to a semantically heterogeneous target schema. We refer to this task as โknowledge translationโ (KT). Unlike data translation and transfer learning, KT does not require any data from the source or target schema. We adopt a probabilistic approach to KT by representing the knowledge in the source schema, the mapping between the source and target schemas, and the resulting knowledge in the target schema all as probability distributions, specially using Markov random fields and Markov logic networks. Given the source knowledge and mappings, we use standard learning and inference algorithms for probabilistic graphical models to find an explicit probability distribution in the target schema that minimizes the Kullback-Leibler divergence from the implicit distribution. This gives us a compact probabilistic model that represents knowledge from the source schema as well as possible, respecting the uncertainty in both the source knowledge and the mapping. In experiments on both propositional and relational domains, we find that the knowledge obtained by KT is comparable to other approaches that require data, demonstrating that knowledge can be reused without data.
Incremental Stochastic Factorization for Online Reinforcement Learning
Barreto, Andre M. S. (Laboratรณrio Nacional de Computaรงรฃo Cientรญfica) | Beirigo, Rafael L. (Laboratรณrio Nacional de Computaรงรฃo Cientรญfica) | Pineau, Joelle (McGill University) | Precup, Doina (McGill University)
A construct that has been receiving attention recently in reinforcement learning is stochastic factorization (SF), a particular case of non-negative factorization (NMF) in which the matrices involved are stochastic. The idea is to use SF to approximate the transition matrices of a Markov decision process (MDP). This is useful for two reasons. First, learning the factors of the SF instead of the transition matrices can reduce significantly the number of parameters to be estimated. Second, it has been shown that SF can be used to reduce the number of operations needed to compute an MDP's value function. Recently, an algorithm called expectation-maximization SF (EMSF) has been proposed to compute a SF directly from transitions sampled from an MDP. In this paper we take a closer look at EMSF. First, by exploiting the assumptions underlying the algorithm, we show that it is possible to reduce it to simple multiplicative update rules similar to the ones that helped popularize NMF. Second, we analyze the optimization process underlying EMSF and find that it minimizes a modified version of the Kullback-Leibler divergence that is particularly well-suited for learning a SF from data sampled from an arbitrary distribution. Third, we build on this improved understanding of EMSF to draw an interesting connection with NMF and probabilistic latent semantic analysis. We also exploit the simplified update rules to introduce a new version of EMSF that generalizes and significantly improves its precursor. This new algorithm provides a practical mechanism to control the trade-off between memory usage and computing time, essentially freeing the space complexity of EMSF from its dependency on the number of sample transitions. The algorithm can also compute its approximation incrementally, which makes it possible to use it concomitantly with the collection of data. This feature makes the new version of EMSF particularly suitable for online reinforcement learning. Empirical results support the utility of the proposed algorithm.
BBookX: Building Online Open Books for Personalized Learning
Liang, Chen (Pennsylvania State University) | Wang, Shuting (Pennsylvania State University) | Wu, Zhaohui (Pennsylvania State University) | Williams, Kyle (Pennsylvania State University) | Pursel, Bart (Pennsylvania State University) | Brautigam, Benjamin (Pennsylvania State University) | Saul, Sherwyn (Pennsylvania State University) | Williams, Hannah (Pennsylvania State University) | Bowen, Kyle (Pennsylvania State University) | Giles, C. Lee (Pennsylvania State University)
We demonstrate BBookX, a novel system that auto-matically builds in collaboration with a user online openbooks by searching open educational resources (OER).This system explores the use of retrieval technologies todynamically generate zero-cost materials such as text-books for personalized learning.