Overview
A Retrospective on Mutual Bootstrapping
Riloff, Ellen (University of Utah) | Jones, Rosie (Microsoft)
When we were invited to write a retrospective article about our AAAI-99 paper on mutual bootstrapping (Riloff and Jones 1999), our first reaction was hesitation because, well, that algorithm seems old and clunky now. But upon reflection, it shaped a great deal of subsequent work on bootstrapped learning for natural language processing, both by ourselves and others. So our second reaction was enthusiasm, for the opportunity to think about the path from 1999 to 2017 and to share the lessons that we learned about bootstrapped learning along the way. This article begins with a brief history of related research that preceded and inspired the mutual bootstrapping work, to position it with respect to that period of time. We then describe the general ideas and approach behind the mutual bootstrapping algorithm. Next, we overview several types of research that have followed and shared similar themes: multi-view learning, bootstrapped lexicon induction, and bootstrapped pattern learning. Finally, we discuss some of the general lessons that we have learned about bootstrapping techniques for NLP to offer guidance to researchers and practitioners who may be interested in exploring these types of techniques in their own work.
Japan's brokerages joining to adopt blockchain- Nikkei Asian Review
Japanese brokerages are launching a consortium dedicated to driving the adoption of blockchain and other innovative technologies in the industry, with the goal of boosting efficiency and providing more convenient services to their customers. The group's 18 founding members include online players like SBI Securities and Rakuten Securities, as well as conventional brokerages like Nomura Securities and Daiwa Securities. SBI Holdings unit SBI Ripple Asia will be the lead organizer for the group. It will hold one or two working meetings a month, as well as carry out proof-of-concept tests on cutting-edge technologies. The group aims to cut costs through industry-wide cooperation. Specifically, the consortium is looking into a shared log-in mechanism for brokerage accounts for customers that uses biometrics and other personal identification information, as well as using artificial intelligence to screen trading activity.
Variational Inference: A Review for Statisticians
Blei, David M., Kucukelbir, Alp, McAuliffe, Jon D.
One of the core problems of modern statistics is to approximate difficult-to-compute probability densities. This problem is especially important in Bayesian statistics, which frames all inference about unknown quantities as a calculation involving the posterior density. In this paper, we review variational inference (VI), a method from machine learning that approximates probability densities through optimization. VI has been used in many applications and tends to be faster than classical methods, such as Markov chain Monte Carlo sampling. The idea behind VI is to first posit a family of densities and then to find the member of that family which is close to the target. Closeness is measured by Kullback-Leibler divergence. We review the ideas behind mean-field variational inference, discuss the special case of VI applied to exponential family models, present a full example with a Bayesian mixture of Gaussians, and derive a variant that uses stochastic optimization to scale up to massive data. We discuss modern research in VI and highlight important open problems. VI is powerful, but it is not yet well understood. Our hope in writing this paper is to catalyze statistical research on this class of algorithms.
Unsupervised Domain Adaptation: from Simulation Engine to the RealWorld
Zhao, Sicheng, Wu, Bichen, Gonzalez, Joseph, Seshia, Sanjit A., Keutzer, Kurt
Large-scale labeled training datasets have enabled deep neural networks to excel on a wide range of benchmark vision tasks. However, in many applications it is prohibitively expensive or time-consuming to obtain large quantities of labeled data. To cope with limited labeled training data, many have attempted to directly apply models trained on a large-scale labeled source domain to another sparsely labeled target domain. Unfortunately, direct transfer across domains often performs poorly due to domain shift and dataset bias. Domain adaptation is the machine learning paradigm that aims to learn a model from a source domain that can perform well on a different (but related) target domain. In this paper, we summarize and compare the latest unsupervised domain adaptation methods in computer vision applications. We classify the non-deep approaches into sample re-weighting and intermediate subspace transformation categories, while the deep strategy includes discrepancy-based methods, adversarial generative models, adversarial discriminative models and reconstruction-based methods. We also discuss some potential directions.
A high-bias, low-variance introduction to Machine Learning for physicists
Mehta, Pankaj, Bukov, Marin, Wang, Ching-Hao, Day, Alexandre G. R., Richardson, Clint, Fisher, Charles K., Schwab, David J.
Machine Learning (ML) is one of the most exciting and dynamic areas of modern research and application. The purpose of this review is to provide an introduction to the core concepts and tools of machine learning in a manner easily understood and intuitive to physicists. The review begins by covering fundamental concepts in ML and modern statistics such as the bias-variance tradeoff, overfitting, regularization, and generalization before moving on to more advanced topics in both supervised and unsupervised learning. Topics covered in the review include ensemble models, deep learning and neural networks, clustering and data visualization, energy-based models (including MaxEnt models and Restricted Boltzmann Machines), and variational methods. Throughout, we emphasize the many natural connections between ML and statistical physics. A notable aspect of the review is the use of Python notebooks to introduce modern ML/statistical packages to readers using physics-inspired datasets (the Ising Model and Monte-Carlo simulations of supersymmetric decays of proton-proton collisions). We conclude with an extended outlook discussing possible uses of machine learning for furthering our understanding of the physical world as well as open problems in ML where physicists maybe able to contribute. (Notebooks are available at https://physics.bu.edu/~pankajm/MLnotebooks.html )
A Primer for Artificial Intelligence and Machine Learning
According to many thought leaders, analysts and early adopters, Artificial Intelligence (AI) and Machine Learning (ML) technologies are rapidly making their way into every industry, geography, system and process. This means that B2B sales and marketers need a primer for artificial intelligence and machine learning to quickly catch up on how they can benefit. This primer on AI and ML will explain how. In general, artificial intelligence is the simulation of human intelligence processes by machines. These processes include learning (the acquisition of information and rules for using the information), reasoning (using the rules to reach approximate or definite conclusions), and self-correction (continuous and tireless learning).
Amazon Machine Learning and Analytics Tools – BMC Blogs
Here we begin our survey of Amazon AWS cloud analytics and big data tools. First we will give an overview of some of what is available. Then we will look at some of them in more detail in subsequent blog posts and provide examples of how to use them. Amazon's approach to selling these cloud services is that these tools take some of the complexity out of developing ML predictive, classification models and neural networks. That is true, but could it be limiting.
Disrupt4.0- Webinar on Deep Learning: Multi-layer ANNs
WEBINAR DESCRIPTION This 1 hour session will provide an overview on Multi-layer Artificial Neural Networks (ANNs). Artificial Neural Networks (ANNs) are the building blocks of modern Deep Learning applications, such as image processing, speech recognition, text analytics, driverless cars etc. This session will cover the basics of multi-layer ANNs, discuss forward propagation and backpropagation logic, cost function in a multi-layer ANN and how to achieve convergence. This session will also include demonstration of small python programs which implement such Multi-Layer ANNs. WARNING:- This is an advanced Deep Learning topic.
Introduction to Machine Learning - Algorithmia Blog
Machine Learning is about making predictions. This post will give an introduction to Machine Learning through a problem that most businesses face: predicting customer churn. ML can help predict which of your customers are at risk for leaving in advance, and give you an edge by pre-empting with action. Machine Learning can best be understood through four progressive lenses. Machine Learning allows us to accurately predict things using simple statistical methods, algorithms, and modern computing power.