Education
Clustering evolving data using kernel-based methods
In this thesis, we propose several modelling strategies to tackle evolving data in different contexts. In the framework of static clustering, we start by introducing a soft kernel spectral clustering (SKSC) algorithm, which can better deal with overlapping clusters with respect to kernel spectral clustering (KSC) and provides more interpretable outcomes. Afterwards, a whole strategy based upon KSC for community detection of static networks is proposed, where the extraction of a high quality training sub-graph, the choice of the kernel function, the model selection and the applicability to large-scale data are key aspects. This paves the way for the development of a novel clustering algorithm for the analysis of evolving networks called kernel spectral clustering with memory effect (MKSC), where the temporal smoothness between clustering results in successive time steps is incorporated at the level of the primal optimization problem, by properly modifying the KSC formulation. Later on, an application of KSC to fault detection of an industrial machine is presented. Here, a smart pre-processing of the data by means of a proper windowing operation is necessary to catch the ongoing degradation process affecting the machine. In this way, in a genuinely unsupervised manner, it is possible to raise an early warning when necessary, in an online fashion. Finally, we propose a new algorithm called incremental kernel spectral clustering (IKSC) for online learning of non-stationary data. This ambitious challenge is faced by taking advantage of the out-of-sample property of kernel spectral clustering (KSC) to adapt the initial model, in order to tackle merging, splitting or drifting of clusters across time. Real-world applications considered in this thesis include image segmentation, time-series clustering, community detection of static and evolving networks.
A Joint Probabilistic Classification Model of Relevant and Irrelevant Sentences in Mathematical Word Problems
Cetintas, Suleyman, Si, Luo, Xin, Yan Ping, Zhang, Dake, Park, Joo Young, Tzur, Ron
Estimating the difficulty level of math word problems is an important task for many educational applications. Identification of relevant and irrelevant sentences in math word problems is an important step for calculating the difficulty levels of such problems. This paper addresses a novel application of text categorization to identify two types of sentences in mathematical word problems, namely relevant and irrelevant sentences. A novel joint probabilistic classification model is proposed to estimate the joint probability of classification decisions for all sentences of a math word problem by utilizing the correlation among all sentences along with the correlation between the question sentence and other sentences, and sentence text. The proposed model is compared with i) a SVM classifier which makes independent classification decisions for individual sentences by only using the sentence text and ii) a novel SVM classifier that considers the correlation between the question sentence and other sentences along with the sentence text. An extensive set of experiments demonstrates the effectiveness of the joint probabilistic classification model for identifying relevant and irrelevant sentences as well as the novel SVM classifier that utilizes the correlation between the question sentence and other sentences. Furthermore, empirical results and analysis show that i) it is highly beneficial not to remove stopwords and ii) utilizing part of speech tagging does not make a significant improvement although it has been shown to be effective for the related task of math word problem type classification.
Zero-Aliasing Correlation Filters for Object Recognition
Fernandez, Joseph A., Boddeti, Vishnu Naresh, Rodriguez, Andres, Kumar, B. V. K. Vijaya
Correlation filters (CFs) are a class of classifiers that are attractive for object localization and tracking applications. Traditionally, CFs have been designed in the frequency domain using the discrete Fourier transform (DFT), where correlation is efficiently implemented. However, existing CF designs do not account for the fact that the multiplication of two DFTs in the frequency domain corresponds to a circular correlation in the time/spatial domain. Because this was previously unaccounted for, prior CF designs are not truly optimal, as their optimization criteria do not accurately quantify their optimization intention. In this paper, we introduce new zero-aliasing constraints that completely eliminate this aliasing problem by ensuring that the optimization criterion for a given CF corresponds to a linear correlation rather than a circular correlation. This means that previous CF designs can be significantly improved by this reformulation. We demonstrate the benefits of this new CF design approach with several important CFs. We present experimental results on diverse data sets and present solutions to the computational challenges associated with computing these CFs. Code for the CFs described in this paper and their respective zero-aliasing versions is available at http://vishnu.boddeti.net/projects/correlation-filters.html
Deep Exponential Families
Ranganath, Rajesh, Tang, Linpeng, Charlin, Laurent, Blei, David M.
We describe \textit{deep exponential families} (DEFs), a class of latent variable models that are inspired by the hidden structures used in deep neural networks. DEFs capture a hierarchy of dependencies between latent variables, and are easily generalized to many settings through exponential families. We perform inference using recent "black box" variational inference techniques. We then evaluate various DEFs on text and combine multiple DEFs into a model for pairwise recommendation data. In an extensive study, we show that going beyond one layer improves predictions for DEFs. We demonstrate that DEFs find interesting exploratory structure in large data sets, and give better predictive performance than state-of-the-art models.
Distributed Policy Evaluation Under Multiple Behavior Strategies
Macua, Sergio Valcarcel, Chen, Jianshu, Zazo, Santiago, Sayed, Ali H.
We apply diffusion strategies to develop a fully-distributed cooperative reinforcement learning algorithm in which agents in a network communicate only with their immediate neighbors to improve predictions about their environment. The algorithm can also be applied to off-policy learning, meaning that the agents can predict the response to a behavior different from the actual policies they are following. The proposed distributed strategy is efficient, with linear complexity in both computation time and memory footprint. We provide a mean-square-error performance analysis and establish convergence under constant step-size updates, which endow the network with continuous learning capabilities. The results show a clear gain from cooperation: when the individual agents can estimate the solution, cooperation increases stability and reduces bias and variance of the prediction error; but, more importantly, the network is able to approach the optimal solution even when none of the individual agents can (e.g., when the individual behavior policies restrict each agent to sample a small portion of the state space).
Opportunistic Storytelling: An Experience-Oriented Strategy for Playable Interactive Narratives
Tomai, Emmett (University of Texas - Pan American)
AI research in interactive narrative often lacks specificity as to the player experience it is trying to enable. In this paper, we consider a set of desirable elements from narrative and interactive experiences, and show by looking at playable experiences from industry and academia that combining them has the potential to be limited or self-defeating. To address these issues, we propose opportunistic storytelling , a set of design principles for near-term playable interactive narratives.
An Interactive Narrative System for Narrative-Based Games for Health
Yin, Langxuan (Northeastern University) | Bickmore, Timothy (Northeastern University) | Montfort, Nick (Massachusetts Institute of Technology)
This paper presents an interactive narrative framework we have designed for games that promote health behavior change. The framework aims to address two key issues: player engagement with the game, and player adherence to the health behavior change-related homework they receive in the game. In this paper, we describe our narrative system that tackles these issues and a prototype game that promotes physical activity in which our narrative system is integrated.
An HRI Approach to Learning from Demonstration
Akgun, Baris (Georgia Institute of Technology) | Bullard, Kalesha (Georgia Institute of Technology) | Chu, Vivian (Georgia Institute of Technology) | Thomaz, Andrea (Georgia Institute of Technology)
The goal of this research is to enable robots to learn new things from everyday people. For years, the AI and Robotics community has sought to enable robots to efficiently learn new skills from a knowledgeable human trainer, and prior work has focused on several important technical problems. This vast amount of research in the field of robot Learning by Demonstration has by and large only been evaluated with expert humans, typically the system's designer. Thus, neglecting a key point that this interaction takes place within a social structure that can guide and constrain the learning problem. %Moreover, we We believe that addressing this point will be essential for developing systems that can learn from everyday people that are not experts in Machine Learning or Robotics. Our work focuses on new research questions involved in letting robots learn from everyday human partners (e.g., What kind of input do people want to provide a machine learner? How does their mental model of the learning process affect this input? What interfaces and interaction mechanisms can help people provide better input from a machine learning perspective?) Often our research begins with an investigation into the feasibility of a particular machine learning interaction, which leads to a series of research questions around re-designing both the interaction and the algorithm to better suit learning with end-users. We believe this equal focus on both the Machine Learning and the HRI contributions are key to making progress toward the goal of machines learning from humans. In this abstract we briefly overview four different projects that highlight our HRI approach to the problem of Learning from Demonstration.
Sensorimotor Analogies in Learning Abstract Skills and Knowledge: Modeling Analogy-Supported Education in Mathematics and Physics
Besold, Tarek Richard (University of Osnabrück)
In this summary report I give an account of research conducted over the last two years, showing the suitability and the advantages of applying computational analogy-engines in the analysis and design of analogy-based methods and tools in teaching and education. This overview constitutes the conclusion of the first phase of a multi-stage effort trying to introduce computational models of analogy also to education and the learning sciences, thus opening up these fields to computational tools and methods not only on an instrumental level, but also in analytical, conceptual, and design-oriented studies. I locate the "analogy-engines in the classroom" research program within the bigger schemes of studying human creativity and computational creativity, provide an introduction to the theoretical underpinnings of the endeavor, and revisit three worked out case studies serving as proofs of the feasibility of the overall approach.
A Skill Transfer Approach for Continuum Robots — Imitation of Octopus Reaching Motion with the STIFF-FLOP Robot
Malekzadeh, Milad S. (Istituto Italiano di Tecnologia (IIT)) | Calinon, Sylvain (Idiap Research Institute and Istituto Italiano di Tecnologia (IIT)) | Bruno, Danilo (Istituto Italiano di Tecnologia (IIT)) | Caldwell, Darwin G. (Istituto Italiano di Tecnologia (IIT))
The problem of transferring skills to hyper-redundant system requires the design of new motion primitive representations that can cope with multiple sources of noise and redundancy, and that can dynamically handle perturbations in the environment. One way is to take inspiration from invertebrate systems in nature to seek for new versatile representations of motion/behavior primitives for continuum robots. In particular, the incredibly varied skills achieved by the octopus can guide us toward the design of such robust encoding scheme. This abstract presents our ongoing work that aims at combining statistical machine learning, dynamical systems and stochastic optimization to study the problem of transferring skills to a flexible surgical robot (STIFF-FLOP) composed of 2 modules with constant curvatures. The approach is tested in simulation by imitation and self-refinement of an octopus reaching motion.