Education
Glass-Box Program Synthesis: A Machine Learning Approach
Christakopoulou, Konstantina (University of Minnesota, Twin Cities) | Kalai, Adam Tauman (Microsoft Research, New England )
Recently proposed models which learn to write computer programs from data use either input/output examples or rich execution traces. Instead, we argue that a novel alternative is to use a glass-box scoring function, given as a program itself that can be directly inspected. Glass-box optimization covers a wide range of problems, from computing the greatest common divisor of two integers, to learning-to-learn problems. In this paper, we present an intelligent search system which learns, given the partial program and the glass-box problem, the probabilities over the space of programs. We empirically demonstrate that our informed search procedure leads to significant improvements compared to brute-force program search, both in terms of accuracy and time. For our experiments we use rich context free grammars inspired by number theory, text processing, and algebra. Our results show that (i) running our framework iteratively can considerably increase the number of problems solved, (ii) our framework can improve itself even in domain agnostic scenarios, and (iii) it can solve problems that would be otherwise too slow to solve with brute-force search.
Thinking in PolAR Pictures: Using Rotation-Friendly Mental Images to Solve Leiter-R Form Completion
Palmer, Joshua H. (Vanderbilt University) | Kunda, Maithilee (Vanderbilt University)
The Leiter International Performance Scale-Revised (Leiter-R) is a standardized cognitive test that seeks to "provide a nonverbal measure of general intelligence by sampling a wide variety of functions from memory to nonverbal reasoning." Understanding the computational building blocks of nonverbal cognition, as measured by the Leiter-R, is an important step towards understanding human nonverbal cognition, especially with respect to typical and atypical trajectories of child development. One subtest of the Leiter-R, Form Completion, involves synthesizing and localizing a visual figure from its constituent slices. Form Completion poses an interesting nonverbal problem that seems to combine several aspects of visual memory, mental rotation, and visual search. We describe a new computational cognitive model that addresses Form Completion using a novel, mental-rotation-friendly image representation that we call the Polar Augmented Resolution (PolAR) Picture, which enables high-fidelity mental rotation operations. We present preliminary results using actual Leiter-R test items and discuss directions for future work.
A Unified Model for Document-Based Question Answering Based on Human-Like Reading Strategy
Li, Weikang (Peking University) | Li, Wei (Peking University) | Wu, Yunfang (Peking University)
Document-based Question Answering (DBQA) in Natural Language Processing (NLP) is important but difficult because of the long document and the complex question. Most of previous deep learning methods mainly focus on the similarity computation between two sentences. However, DBQA stems from the reading comprehension in some degree, which is originally used to train and test people's ability of reading and logical thinking. Inspired by the strategy of doing reading comprehension tests, we propose a unified model based on the human-like reading strategy. The unified model contains three major encoding layers that are consistent to different steps of the reading strategy, including the basic encoder, combined encoder and hierarchical encoder. We conduct extensive experiments on both the English WikiQA dataset and the Chinese dataset, and the experimental results show that our unified model is effective and yields state-of-the-art results on WikiQA dataset.
Learning Nonlinear Dynamics in Efficient, Balanced Spiking Networks Using Local Plasticity Rules
Alemi, Alireza (ENS and UC Davis) | Machens, Christian K. (Champalimaud Centre for the Unknown) | Deneve, Sophie (Ecole Normale Superieure, Paris) | Slotine, Jean-Jacques (MIT)
The brain uses spikes in neural circuits to perform many dynamical computations. The computations are performed with properties such as spiking efficiency, i.e. minimal number of spikes, and robustness to noise. A major obstacle for learning computations in artificial spiking neural networks with such desired biological properties is due to lack of our understanding of how biological spiking neural networks learn computations. Here, we consider the credit assignment problem, i.e. determining the local contribution of each synapse to the network's global output error, for learning nonlinear dynamical computations in a spiking network with the desired properties of biological networks. We approach this problem by fusing the theory of efficient, balanced neural networks (EBN) with nonlinear adaptive control theory to propose a local learning rule. Locality of learning rules are ensured by feeding back into the network its own error, resulting in a learning rule depending solely on presynaptic inputs and error feedbacks. The spiking efficiency and robustness of the network are guaranteed by maintaining a tight excitatory/inhibitory balance, ensuring that each spike represents a local projection of the global output error and minimizes a loss function. The resulting networks can learn to implement complex dynamics with very small numbers of neurons and spikes, exhibit the same spike train variability as observed experimentally, and are extremely robust to noise and neuronal loss.
On Validation and Predictability of Digital Badges’ Influence on Individual Users
Kuśmierczyk, Tomasz (Norwegian University of Science and Technology) | Nørvåg, Kjetil (Norwegian University of Science and Technology)
Badges are a common, and sometimes the only, method of incentivizing users to perform certain actions on on- line sites. However, due to many competing factors influencing user temporal dynamics, it is difficult to determine whether the badge had (or will have) the intended effect or not. In this paper, we introduce two complementary approaches for determining badge influence on users. In the first one, we cluster users’ temporal traces (represented with Poisson processes) and apply covariates (user features) to regularize results. In the second approach, we first classify users’ temporal traces with a novel statistical framework, and then we refine the classification results with a semi-supervised clustering of covariates. Outcomes obtained from an evaluation on synthetic datasets and experiments on two badges from a pop- ular Q&A platform confirm that it is possible to validate, characterize and to some extent predict users affected by the badge.
Generating an Event Timeline About Daily Activities From a Semantic Concept Stream
Miyanishi, Taiki (Advanced Telecommunications Research Institute International (ATR)) | Hirayama, Jun-ichiro (RIKEN Center for Advanced Intelligence Project (AIP)) | Maekawa, Takuya (Advanced Telecommunications Research Institute International (ATR)) | Kawanabe, Motoaki (Graduate School of Information Science and Technology, Osaka University)
Recognizing activities of daily living (ADLs) in the real world is an important task for understanding everyday human life. However, even though our life events consist of chronological ADLs with the corresponding places and objects (e.g., drinking coffee in the living room after making coffee in the kitchen and walking to the living room), most existing works focus on predicting individual activity labels from sensor data. In this paper, we introduce a novel framework that produces an event timeline of ADLs in a home environment. The proposed method combines semantic concepts such as action, object, and place detected by sensors for generating stereotypical event sequences with the following three real-world properties. First, we use temporal interactions among concepts to remove objects and places unrelated to each action. Second, we use commonsense knowledge mined from a language resource to find a possible combination of concepts in the real world. Third, we use temporal variations of events to filter repetitive events, since our daily life changes over time. We use cross-place validation to evaluate our proposed method on a daily-activities dataset with manually labeled event descriptions. The empirical evaluation demonstrates that our method using real-world properties improves the performance of generating an event timeline over diverse environments.
Optimization Methods for Large-Scale Machine Learning
Bottou, Léon, Curtis, Frank E., Nocedal, Jorge
This paper provides a review and commentary on the past, present, and future of numerical optimization algorithms in the context of machine learning applications. Through case studies on text classification and the training of deep neural networks, we discuss how optimization problems arise in machine learning and what makes them challenging. A major theme of our study is that large-scale machine learning represents a distinctive setting in which the stochastic gradient (SG) method has traditionally played a central role while conventional gradient-based nonlinear optimization techniques typically falter. Based on this viewpoint, we present a comprehensive theory of a straightforward, yet versatile SG algorithm, discuss its practical behavior, and highlight opportunities for designing algorithms with improved performance. This leads to a discussion about the next generation of optimization methods for large-scale machine learning, including an investigation of two main streams of research on techniques that diminish noise in the stochastic directions and methods that make use of second-order derivative approximations.
What 2018 Holds for the Future of Work
The digital workplace will fundamentally change the way we work. Established and emerging technologies play a big part in this, but our day to day work and the way organizations manage this work is undergoing a radical shift. Gartner VP Matthew W Cain and Gartner research director Helen Poitevin outlined 11 emerging trends that will shape digital workplaces in the years to come at the Gartner Digital Workplace Summit in London earlier this year, many of which held true. But as Cain and Poitevin noted, the digital workplace is not just about technologies. The following 10 trends explore the how the way we work will continue to change over the coming year.
KWHS Educator Toolkit: Artificial Intelligence
Movie buffs have been hearing about artificial intelligence for years – from Steven Spielberg's 2001 science fiction drama AI to the 2015 robotic police force in Chappie and beyond. AI is no longer the stuff of science fiction. This essential part of the technology sector aims to create intelligent machines of all kinds that think, work and react like humans. Just as electricity transformed the way industries functioned in the past century, artificial intelligence -- the science of programming cognitive abilities into machines -- has the power to substantially change society in the next 100 years. AI is being harnessed to enable such things as home robots, robo-taxis and mental health chatbots to make you feel better.
Become a Deep Learning Coder From Scratch in Under a Year
Machine learning (aka A.I.) seems bizarre and complicated. It's the tech behind image and speech recognition, recommendation systems, and all kinds of tasks that computers used to be really bad at but are now really good at. It involves teaching a computer to teach itself. And you can learn to do it in well under a year, according to data scientist Bargava. You'll need to put in a solid 10-20 hours a week, but you will learn a lot along the way.