Learning Graphical Models
Exploring Child-Robot Tutoring Interactions with Bayesian Knowledge Tracing
Spaulding, Samuel (Massachusetts Institute of Technology) | Breazeal, Cynthia (Massachusetts Institute of Technology)
Computer Science researchers have long sought ways to apply the fruits of their labors to education. From the Logo turtles to the latest Cognitive Tutors, the allure of computers that will understand and help humans learn and grow has been a constant thread in Artificial Intelligence research. Now, advances in robotics and our understanding of Human-Robot Interaction make it feasible to develop physically-present robots that are capable of presenting educational material in an engaging manner, adapting online to sensory information from individual students, and building sophisticated, personalized models of a student’s mastery over complex educational domains. In this paper, we discuss how using physical robots as platforms for artificially intelligent tutors enables an expanded space of possible educational interactions. We also describe a work-in-progress to (1) extend previous work in personalized user models for robotic tutoring and (2) further explore the differences between interaction with physical robots and onscreen agents. Specifically, we are examining how embedding an tutoring interaction inside a story, game, or activity with an agent may differentially affect learning gains and engagement in interactions with physical robots and screen-based agents.
Learning Human Types from Demonstration
Nikolaidis, Stefanos (Massachusetts Institute of Technology) | Gu, Keren (Massachusetts Institute of Technology) | Ramakrishnan, Ramya (Massachusetts Institute of Technology) | Shah, Julie (Massachusetts Institute of Technology)
Research on POMDP formulations for collaborative tasks in game AI applications (Nguyen et al. 2011; Macindoe, The development of new industrial robotic systems that operate Kaelbling, and Lozano-Pérez 2012; Silver and Veness in the same physical space as people highlights the 2010) also assumed a known human model. Additionally, emerging need for robots that can integrate seamlessly into previous partially observable formalisms (Ong et al. 2010; human group dynamics by adapting to the personalized style Bandyopadhyay et al. 2013; Broz, Nourbakhsh, and Simmons of human teammates. This adaptation requires learning a statistical 2011; Fern and Tadepalli 2010; Nguyen et al. 2011; model of human behavior and integrating this model Macindoe, Kaelbling, and Lozano-Pérez 2012) in assistive into the decision-making algorithm of the robot in a principled or collaborative tasks represented the preference or intention way. We present a framework for automatically learning of the human for their own actions, rather than those of human user models from joint-action demonstrations the robot, as the partially observable variable.
Building Blocks of Social Intelligence: Enabling Autonomy for Socially Intelligent and Assistive Robots
Mead, Ross Alan (University of Southern California) | Atrash, Amin (University of Southern California) | Kaszubski, Edward (University of Southern California) | Clair, Aaron St. (University of Southern California) | Greczek, Jillian (University of Southern California) | Clabaugh, Caitlyn (University of Southern California) | Kohan, Brian (University of Southern California) | Mataric, Maja J. (University of Southern California)
Vocalics is the study of the nonverbal aspects of speech, such as volume, pitch, and rate. Our contribution is a parametric We present an overview of the control, recognition, decision-making, vocalic behavior controller that autonomously adjusts and learning techniques utilized by the Interaction the robot speaker volume based on models of how a Lab (robotics.usc.edu/interaction) at the University human user will hear speech produced by the robot. These of Southern California (USC) to enable autonomy in sociable models vary with distance, orientation, and perceived environmental and socially assistive robots. These techniques are implemented interference (Mead & Matarić 2014). Our future with two software libraries: 1) the Social Behavior work will investigate adapting the pitch and rate of speech Library (SBL) provides autonomous social behavior produced by a robot to improve user speech perception.
Modeling Human-Robot Interactions as Systems of Distributed Cognition
Huang, Chien-Ming (University of Wisconsin-Madison) | Mutlu, Bilge (University of Wisconsin-Madison)
Robots that are integrated into day-to-day settings as assistants, collaborators, and companions will engage in dynamic, physically-situated social interactions with their users. Enabling such interactions will require appropriate models and representations for interaction. In this paper, we argue that the dynamic, physically-situated interactions between humans and robots can be characterized as a system of distributed cognition, that this system can be represented using probabilistic graphical models (PGMs), and that the parameters of these models can be learned from human interactions. We illustrate the application of this perspective in our ongoing research on modeling dyadic referential communication.
Intention-Aware Multi-Human Tracking for Human-Robot Interaction via Particle Filtering over Sets
Bai, Aijun (University of Science and Technology of China) | Simmons, Reid (Carnegie Mellon University) | Veloso, Manuela (Carnegie Mellon University) | Chen, Xiaoping (University of Science and Technology of China)
In order to successfully interact with multiple humans in social situations, an intelligent robot should have the ability to track multi-humans, and understand their motion intentions. We formalize this problem as a hidden Markov model, and estimate the posterior densities by particle filtering over sets approach. Our approach avoids directly performing observation-to-target association by defining a set as a joint state. The human identification problem is then solved in an expectation-maximization way. We evaluate the effectiveness of our approach by both benchamark test and real robot experiments.
Humanoid Robots and Spoken Dialog Systems for Brief Health Interventions
Abeyruwan, Saminda (University of Miami) | Baral, Ramesh (Florida International University) | Yasavur, Ugan (Florida International University) | Lisetti, Christine (Florida International University) | Visser, Ubbo (University of Miami)
We combined a spoken dialog system that we developed to deliver brief health interventions with the fully autonomous humanoid robot (NAO). The dialog system is based on a framework facilitating Markov decision processes (MDP). It is optimized using reinforcement learning (RL) algorithms with data we collected from real user interactions. The system begins to learn optimal dialog strategies for initiative selection and for the type of confirmations that it uses during theinteraction. The health intervention, delivered by a 3D character instead of the NAO, has already been evaluated, with positive results in terms of task completion, ease of use, and future intention to use the system. The current spoken dialog system for the humanoid robot is a novelty and exists so far as a proof ofconcept.
A Markov Decision Process Framework for Predictable Job Completion Times on Crowdsourcing Platforms
Lakshminarayanan, Chandrashekar (Indian Institute of Science) | Dubey, Ayush (Indian Institute of Science) | Bhatnagar, Shalabh (Indian Institute of Science) | Balamurugan, Chithralekha (Xerox Research Centre India)
Task starvation leads to huge variation in the completion times of the tasks posted on to the crowd. The price offered to a given task together with the dynamics of the crowd at the time of posting affect its completion time. Large organizations/requesters who frequent the crowd at regular intervals in order to get their tasks done desire predictability in completion times of the tasks. Thus, such requesters have to take into account the crowd dynamics at the time of posting the tasks and price them accordingly. In this work, we study an instance of the pricing problem and propose a solution based on the framework of Markov Decision Processes (MDPs).
Predicting Next Label Quality: A Time-Series Model of Crowdwork
Jung, Hyun Joon (University of Texas at Austin) | Park, Yubin (University of Texas at Austin) | Lease, Matthew (University of Texas at Austin)
While temporal behavioral patterns can be discerned to underlie real crowd work, prior studies have typically modeled worker performance under a simplified i.i.d. assumption. To better model such temporal worker behavior, we propose a time-series label prediction model for crowd work. This latent variable model captures and summarizes past worker behavior, enabling us to better predict the quality of each worker's next label. Given inherent uncertainty in prediction, we also investigate a decision reject option to balance the tradeoff between prediction accuracy vs. coverage. Results show our model improves accuracy of both label prediction on real crowd worker data, as well as data quality overall.
Parallel Task Routing for Crowdsourcing
Bragg, Jonathan (University of Washington) | Kolobov, Andrey (Microsoft Research) | Mausam, Mausam (Indian Institute of Technology, Delhi) | Weld, Daniel S. (University of Washington)
An ideal crowdsourcing or citizen-science system would route tasks to the most appropriate workers, but the best assignment is unclear because workers have varying skill, tasks have varying difficulty, and assigning several workers to a single task may significantly improve output quality. This paper defines a space of task routing problems, proves that even the simplest is NP-hard, and develops several approximation algorithms for parallel routing problems. We show that an intuitive class of requesters' utility functions is submodular, which lets us provide iterative methods for dynamically allocating batches of tasks that make near-optimal use of available workers in each round. Experiments with live oDesk workers show that our task routing algorithm uses only 48% of the human labor compared to the commonly used round-robin strategy. Further, we provide versions of our task routing algorithm which enable it to scale to large numbers of workers and questions and to handle workers with variable response times while still providing significant benefit over common baselines.
Altitude Training: Strong Bounds for Single-Layer Dropout
Wager, Stefan, Fithian, William, Wang, Sida, Liang, Percy
Dropout training, originally designed for deep neural networks, has been successful on high-dimensional single-layer natural language tasks. This paper proposes a theoretical explanation for this phenomenon: we show that, under a generative Poisson topic model with long documents, dropout training improves the exponent in the generalization bound for empirical risk minimization. Dropout achieves this gain much like a marathon runner who practices at altitude: once a classifier learns to perform reasonably well on training examples that have been artificially corrupted by dropout, it will do very well on the uncorrupted test set. We also show that, under similar conditions, dropout preserves the Bayes decision boundary and should therefore induce minimal bias in high dimensions.