Education
Special Track on Intelligent Tutoring Systems
Roscoe, Rod (Arizona State University) | Segedy, James (Vanderbilt University)
In general, the goal of the track is to bring together an international group of scientists to present current research, design, and empirical evaluations of their tutoring systems. This track is meant to inform researchers on the recent developments in both the design and evaluation of tutoring sys- tems.
A Natural Language Conversational System for Online Academic Advising
Latorre-Navarro, Edward M. (University of Florida) | Harris, John G. (University of Florida)
We have designed an academic advising online system to advise college students using natural language conversations. The system embeds knowledge of current and future teaching schedules, degree requirements, course prerequisites and various administrative procedures. While this information can be found by searching several university websites and catalogs, students continually ask human advisors these questions during their limited face-to-face time, which limits deeper developmental and educative advising that is only available from human advisors. Our system enhances the advising experience by offering a source for instant academic advice that does not require student training or additional human resources. The system contains a pattern-matching dialog management system with access via a web browser. We describe the motivation for our system, the design requisites, our approach for deployment, and analyze results from real-world field tests.
Perceptron-like Algorithms and Generalization Bounds for Learning to Rank
Chaudhuri, Sougata, Tewari, Ambuj
Learning to rank is a supervised learning problem where the output space is the space of rankings but the supervision space is the space of relevance scores. We make theoretical contributions to the learning to rank problem both in the online and batch settings. First, we propose a perceptron-like algorithm for learning a ranking function in an online setting. Our algorithm is an extension of the classic perceptron algorithm for the classification problem. Second, in the setting of batch learning, we introduce a sufficient condition for convex ranking surrogates to ensure a generalization bound that is independent of number of objects per query. Our bound holds when linear ranking functions are used: a common practice in many learning to rank algorithms. En route to developing the online algorithm and generalization bound, we propose a novel family of listwise large margin ranking surrogates. Our novel surrogate family is obtained by modifying a well-known pairwise large margin ranking surrogate and is distinct from the listwise large margin surrogates developed using the structured prediction framework. Using the proposed family, we provide a guaranteed upper bound on the cumulative NDCG (or MAP) induced loss under the perceptron-like algorithm. We also show that the novel surrogates satisfy the generalization bound condition.
Cover Tree Bayesian Reinforcement Learning
Tziortziotis, Nikolaos, Dimitrakakis, Christos, Blekas, Konstantinos
This paper proposes an online tree-based Bayesian approach for reinforcement learning. For inference, we employ a generalised context tree model. This defines a distribution on multivariate Gaussian piecewise-linear models, which can be updated in closed form. The tree structure itself is constructed using the cover tree method, which remains efficient in high dimensional spaces. We combine the model with Thompson sampling and approximate dynamic programming to obtain effective exploration policies in unknown environments. The flexibility and computational simplicity of the model render it suitable for many reinforcement learning problems in continuous state spaces. We demonstrate this in an experimental comparison with a Gaussian process model, a linear model and simple least squares policy iteration.
Robot Learning from Human Teachers
Chernova, Sonia, Thomaz, Andrea L.
Learning from Demonstration (LfD) explores techniques for learning a task policy from examples provided by a human teacher. The field of LfD has grown into an extensive body of literature over the past 30 years, with a wide variety of approaches for encoding human demonstrations and modeling skills and tasks. Additionally, we have recently seen a focus on gathering data from non-expert human teachers (i.e., domain experts but not robotics experts). In this book, we provide an introduction to the field with a focus on the unique technical challenges associated with designing robots that learn from naive human teachers. We begin, in the introduction, with a unification of the various terminology seen in the literature as well as an outline of the design choices one has in designing an LfD system.
Model Based Clustering of High-Dimensional Binary Data
Tang, Yang, Browne, Ryan P., McNicholas, Paul D.
We propose a mixture of latent trait models with common slope parameters (MCLT) for model-based clustering of high-dimensional binary data, a data type for which few established methods exist. Recent work on clustering of binary data, based on a $d$-dimensional Gaussian latent variable, is extended by incorporating common factor analyzers. Accordingly, our approach facilitates a low-dimensional visual representation of the clusters. We extend the model further by the incorporation of random block effects. The dependencies in each block are taken into account through block-specific parameters that are considered to be random variables. A variational approximation to the likelihood is exploited to derive a fast algorithm for determining the model parameters. Our approach is demonstrated on real and simulated data.
Statistical Decision Making for Optimal Budget Allocation in Crowd Labeling
Chen, Xi, Lin, Qihang, Zhou, Dengyong
In crowd labeling, a large amount of unlabeled data instances are outsourced to a crowd of workers. Workers will be paid for each label they provide, but the labeling requester usually has only a limited amount of the budget. Since data instances have different levels of labeling difficulty and workers have different reliability, it is desirable to have an optimal policy to allocate the budget among all instance-worker pairs such that the overall labeling accuracy is maximized. We consider categorical labeling tasks and formulate the budget allocation problem as a Bayesian Markov decision process (MDP), which simultaneously conducts learning and decision making. Using the dynamic programming (DP) recurrence, one can obtain the optimal allocation policy. However, DP quickly becomes computationally intractable when the size of the problem increases. To solve this challenge, we propose a computationally efficient approximate policy, called optimistic knowledge gradient policy. Our MDP is a quite general framework, which applies to both pull crowdsourcing marketplaces with homogeneous workers and push marketplaces with heterogeneous workers. It can also incorporate the contextual information of instances when they are available. The experiments on both simulated and real data show that the proposed policy achieves a higher labeling accuracy than other existing policies at the same budget level.
Algorithms and Applications for the Same-Decision Probability
Chen, S. J., Choi, A., Darwiche, A.
When making decisions under uncertainty, the optimal choices are often difficult to discern, especially if not enough information has been gathered. Two key questions in this regard relate to whether one should stop the information gathering process and commit to a decision (stopping criterion), and if not, what information to gather next (selection criterion). In this paper, we show that the recently introduced notion, Same-Decision Probability (SDP), can be useful as both a stopping and a selection criterion, as it can provide additional insight and allow for robust decision making in a variety of scenarios. This query has been shown to be highly intractable, being PP^PP-complete, and is exemplary of a class of queries which correspond to the computation of certain expectations. We propose the first exact algorithm for computing the SDP, and demonstrate its effectiveness on several real and synthetic networks. Finally, we present new complexity results, such as the complexity of computing the SDP on models with a Naive Bayes structure. Additionally, we prove that computing the non-myopic value of information is complete for the same complexity class as computing the SDP.
Communication Communities in MOOCs
Gillani, Nabeel, Eynon, Rebecca, Osborne, Michael, Hjorth, Isis, Roberts, Stephen
Massive Open Online Courses (MOOCs) bring together thousands of people from different geographies and demographic backgrounds -- but to date, little is known about how they learn or communicate. We introduce a new content-analysed MOOC dataset and use Bayesian Non-negative Matrix Factorization (BNMF) to extract communities of learners based on the nature of their online forum posts. We see that BNMF yields a superior probabilistic generative model for online discussions when compared to other models, and that the communities it learns are differentiated by their composite students' demographic and course performance indicators. These findings suggest that computationally efficient probabilistic generative modelling of MOOCs can reveal important insights for educational researchers and practitioners and help to develop more intelligent and responsive online learning environments.
Hybrid Conditional Gradient - Smoothing Algorithms with Applications to Sparse and Low Rank Regularization
Argyriou, Andreas, Signoretto, Marco, Suykens, Johan
Conditional gradient methods are old and well studied optimization algorithms. Their origin dates at least to the 50's and the Frank-Wolfe algorithm for quadratic programming [18] but they apply to much more general optimization problems. General formulations of conditional gradient algorithms have been studied in the past and various convergence properties of these algorithms have been proven. Moreover, such algorithms have found application in many fields, such as optimal control, statistics, signal processing, computational geometry and machine learning. Currently, interest in conditional gradient methods is undergoing a revival because of their computational advantages when applied to certain large scale optimization problems. Such examples are regularization problems involving sparsity or low rank constraints, which appear in many widely used methods in machine learning. Inspired by such algorithms, in this chapter we study a first-order method for solving certain convex optimization problems. We focus on problems of the form min {f(x) g(Ax) ω(x): x H}. 1 over a real Hilbert space H. We assume that f is a convex function with Hölder continuous gradient, g a Lipschitz continuous convex function, A a bounded linear operator and ω a convex function defined over a bounded domain.