Industry
The Importance of Selective Knowledge Transfer for Lifelong Learning
Eaton, Eric (Bryn Mawr College) | Lane, Terran (University of New Mexico)
Versatile agents situated in rich, dynamic environments must It is not necessarily possible to select the source knowledge be capable of continually learning and refining their knowledge to transfer to a new target task by examining only the surface through experience. These agents will face a variety of similarities between the tasks. The selection must support learning tasks, and can transfer knowledge between tasks to the process of knowledge transfer by choosing source improve performance and accelerate learning. In this context, knowledge based on whether it will transfer well to the target a learning task can be as simple as discovering the effects task. In our previous work, we developed methods that of an operator on the environment, or as complex as accomplishing identify the source knowledge to transfer based on this concept a specific goal -- anything that can be learned of transferability to the target task. Intuitively, transferability can be considered a task. As the agent experiences and learns is the amount that the transferred information is a model for each task, it gains access to new data and knowledge.
The Common Origins of Language and Action
D' (IIT - Istituto Italiano di Tecnologia) | Ausilio, Alessandro ( IIT - Istituto Italiano di Tecnologia ) | Fadiga, Luciano
In fact, goal-driven hierarchical structure to concatenate simple human behavior is mostly constituted by goal-directed motor acts. This hierarchical goal structure as well as the actions based on the synergic composition of simpler rules, which connect individual motor elements, might be motor constituents chained together according to a precise paralleled to the syntactic organization of language.
Language Models for Semantic Extraction and Filtering in Video Action Recognition
Tzoukermann, Evelyne (The MITRE Corporation) | Neumann, Jan (Comcast) | Kosecka, Jana (George Mason University) | Fermuller, Cornelia (University of Maryland) | Perera, Ian (University of Pennsylvania) | Ferraro, Frank (University of Rochester) | Sapp, Ben (University of Pennsylvania) | Chaudhry, Rizwan (Johns Hopkins University) | Singh, Gautam (George Mason University)
The paper addresses the following issues:ย (a) how to represent semantic information from natural language so that a vision model can utilize it?ย (b) how to extract the salient textual information relevant to vision?ย For a given domain, we present a new model of semantic extraction that takes into account word relatedness as well as word disambiguation in order to apply to a vision model. We automatically process the text transcripts and perform syntactic analysis to extract dependency relations. We then perform semantic extraction on the output to filter semantic entities related to actions. The resulting data are used to populate a matrix of co-occurrences utilized by the vision processing modules.ย Results show that explicitly modeling the co-occurrence of actions and tools significantly improved performance.
FAQ-Learning in Matrix Games: Demonstrating Convergence Near Nash Equilibria, and Bifurcation of Attractors in the Battle of Sexes
Kaisers, Michael (Maastricht University) | Tuyls, Karl (Maastricht University)
This article studies Frequency Adjusted Q-learning (FAQ-learning), a variation of Q-learning that simulates simultaneous value function updates. The main contributions are empirical and theoretical support for the convergence of FAQ-learning to attractors near Nash equilibria in two-agent two-action matrix games.The games can be divided into three types: Matching pennies, Prisoners' Dilemma and Battle of Sexes. This article shows that the Matching pennies and Prisoners' Dilemma yield one attractor of the learning dynamics, while the Battle of Sexes exhibits a supercritical pitchfork bifurcation at a critical temperature, where one attractor splits into two attractors and one repellent fixed point. Experiments illustrate that the distance between fixed points of the FAQ-learning dynamics and Nash equilibria tends to zero as the exploration parameter of FAQ-learning approaches zero.
Efficiently Eliciting Preferences from a Group of Users
Hines, Greg (University of Waterloo) | Larson, Kate (University of Waterloo)
Learning about users' preferences allows agents to make intelligent decisions on behalf of users. When we are eliciting preferences from a group of users, we can use the preferences of the users we have already processed to increase the efficiency of the elicitation process for the remaining users. However, current methods either require strong prior knowledge about the users' preferences or can be overly cautious and inefficient. Our method, based on standard techniques from non-parametric statistics, allows the controller to choose a balance between prior knowledge and efficiency. This balance is investigated through experimental results.
Reciprocal Preference Model for Two Player Dilemma Games
Ahmed, Asrar (IIIT Hyderabad) | Karlapalem, Kamalakar (IIIT Hyderabad)
Results from behavioral economics show that individuals do not always maximize monetary payoffs. Within behavioral economics different models of social preference have been put forth to account for this deviation from standard assumptions of game theory and economics. Incorporating such models into agent decision making is increasingly relevant to design systems which interact with or on behalf of humans. Existing models, which correctly predict outcomes across a large set of games, are fairly complex. To this end, we present aspiration based social preference model and evaluate it by considering two player dilemma games. We show that the qualitative predictions of our model are consistent with results from behavioral economics.
Leading Multiple Ad Hoc Teammates in Joint Action Settings
Agmon, Noa (The University of Texas at Austin) | Stone, Peter (The University of Texas at Austin)
The growing use of autonomous agents in practice may require agents to cooperate as a team in situations where they have limited prior knowledge about one another, cannot communicate directly, or do not share the same world models. These situations raise the need to design ad hoc team members, i.e., agents that will be able to cooperate without coordination in order to reach an optimal team behavior. This paper considers problem of leading N-agent teams by a single agent toward their optimal joint utility, where the agents compute their next actions based only on their most recent observations of their teammates' actions. We show that compared to previous results in two-agent teams, in larger teams the agent might not be able to lead the team to the action with maximal joint utility. In these cases, the agent's optimal strategy leads the team to the best possible reachable cycle of joint actions. We describe a graphical model of the problem and a polynomial time algorithm for solving it. We then consider the problem of leading teams where the agents' base their actions on a longer history of past observations, showing that the an upper bound computation time exponential in the memory size is very likely to be tight.
The Elderly and Robots: From Experiments based on Comparison with Younger People
Nomura, Tatsuya (Ryukoku University) | Takeuchi, Saori (Ryukoku University)
Robot factors such as motions and utterances have a possibility of interaction effects with generation and other human factors, and these effects influence robotics design in elder care. Some psychological experiments conducted in our research group found these interaction effects between generation and other factors based on directly comparison between younger and elder persons in interaction with a small-sized humanoid robot. The paper firstly reviews the previous two studies, reports results of the current experiment, and then discusses about their implications from the perspective of robotics design for elder care.
Ethical Implications of Using the Paro Robot, with a Focus on Dementia Patient Care
Calo, Christopher James (Southern New Hampshire University) | Hunt-Bull, Nicholas (Southern New Hampshire University) | Lewis, Lundy (Southern New Hampshire University) | Metzler, Ted ( Oklahoma City University )
This paper examines the ability of the Paro robot to improve the lives of elderly dementia patients by applying modern technology to medicine. Paro is not intended to be a replacement for social interaction with people or animals. Some patients who know Paro is a robot still enjoy using the robotic seal, and it can calm patients who are otherwise unreachable. Robots like Paro which mimic the behaviors of pets offer excellent opportunities to connect with challenging patients; however they raise concerns regarding patient rights and autonomy. While such concerns are worthy of consideration, which we discuss in this paper, we nonetheless conclude that the benefits of using such a treatment tool outweigh its potential risks.
A General Perceptual Model for Eldercare Robots
Becker, Timothy James (University of Hartford)
A general perceptual model is proposed for Eldercare Robot implementation that is comprised of audition functionality interconnected with a feedback-driven perceptual reasoning agent. Using multistage signal analysis to feed temporally tiered learning/recognition modules, concurrent access to sound event localization, classification, and context is realized. Patterns leading to the quantification of patient emotion/well being can be inferred using a perceptual reasoning agent. The system is prototyped using a Nao H-25 humanoid robot with an online processor running the Nao Qi SDK and the Max/MSP environment with the FTM, and GF libraries.