Goto

Collaborating Authors

 Agents


"Conservatives Overfit, Liberals Underfit": The Social-Psychological Control of Affect and Uncertainty

arXiv.org Artificial Intelligence

The presence of artificial agents in human social networks is growing. From chatbots to robots, human experience in the developed world is moving towards a socio-technical system in which agents can be technological or biological, with increasingly blurred distinctions between. Given that emotion is a key element of human interaction, enabling artificial agents with the ability to reason about affect is a key stepping stone towards a future in which technological agents and humans can work together. This paper presents work on building intelligent computational agents that integrate both emotion and cognition. These agents are grounded in the well-established social-psychological Bayesian Affect Control Theory (BayesAct). The core idea of BayesAct is that humans are motivated in their social interactions by affective alignment: they strive for their social experiences to be coherent at a deep, emotional level with their sense of identity and general world views as constructed through culturally shared symbols. This affective alignment creates cohesive bonds between group members, and is instrumental for collaborations to solidify as relational group commitments. BayesAct agents are motivated in their social interactions by a combination of affective alignment and decision theoretic reasoning, trading the two off as a function of the uncertainty or unpredictability of the situation. This paper provides a high-level view of dual process theories and advances BayesAct as a plausible, computationally tractable model based in social-psychological theory. We introduce a revised BayesAct model that more deeply integrates social-psychological theorising, and we demonstrate a component of the model as being sufficient to account for cognitive biases about fairness, dissonance and conformity. We show how the model can unify different exploration strategies in reinforcement learning.


PHYRE: A New Benchmark for Physical Reasoning

arXiv.org Artificial Intelligence

Understanding and reasoning about physics is an important ability of intelligent agents. We develop the PHYRE benchmark for physical reasoning that contains a set of simple classical mechanics puzzles in a 2D physical environment. The benchmark is designed to encourage the development of learning algorithms that are sample-efficient and generalize well across puzzles. We test several modern learning algorithms on PHYRE and find that these algorithms fall short in solving the puzzles efficiently. We expect that PHYRE will encourage the development of novel sample-efficient agents that learn efficient but useful models of physics. For code and to play PHYRE for yourself, please visit https://player.phyre.ai.


Mastering emergent language: learning to guide in simulated navigation

arXiv.org Artificial Intelligence

To cooperate with humans effectively, virtual agents need to be able to understand and execute language instructions. A typical setup to achieve this is with a scripted teacher which guides a virtual agent using language instructions. However, such setup has clear limitations in scalability and, more importantly, it is not interactive. Here, we introduce an autonomous agent that uses discrete communication to interactively guide other agents to navigate and act on a simulated environment. The developed communication protocol is trainable, emergent and requires no additional supervision. The emergent language speeds up learning of new agents, it generalizes across incrementally more difficult tasks and, contrary to most other emergent languages, it is highly interpretable. We demonstrate how the emitted messages correlate with particular actions and observations, and how new agents become less dependent on this guidance as training progresses. By exploiting the correlations identified in our analysis, we manage to successfully address the agents in their own language.


Evaluating Empathy in Artificial Agents

arXiv.org Artificial Intelligence

Abstract--The novel research area of computational empathy is in its infancy and moving towards developing methods and standards. One major problem is the lack of agreement on the evaluation of empathy in artificial interactive systems . Even though the existence of well-established methods from psyc hol-ogy, psychiatry and neuroscience, the translation between these methods and computational empathy is not straightforward. It requires a collective effort to develop metrics that are mor e suitable for interactive artificial agents. This paper is ai med as an attempt to initiate the dialogue on this important proble m. We examine the evaluation methods for empathy in humans and provide suggestions for the development of better metri cs to evaluate empathy in artificial agents. We acknowledge the difficulty of arriving at a single solution in a vast variety o f interactive systems and propose a set of systematic approac hes that can be used with a variety of applications and systems. Emerging technologies continue to change the ways in which we interact with computers. Computational systems are evolving from being mere tools to assistants, trainers and companion agents. All of these new roles assigned to these systems highlight the importance of embodying these agents with social and emotional capabilities.


#IJCAI in tweets – tutorials and workshops day 2

Robohub

Here's our daily update in tweets, live from IJCAI (International Joint Conference on Artificial Intelligence) in Macau. Like yesterday, we'll be covering tutorials and workshops. Now attending the #tutorial "Argumentation and Machine Learning: When the Whole is Greater than the Sum of its Parts" by @CeruttiFederico, & learning about #ML mechanisms that create, annotate, analyze & evaluate arguments expressed in natural language.#AI Now: "Dialogues with Socially Aware Robot Agents – Knowledge & Reasoning using Natural Language," an invited #IJCAI2019 talk by Prof. Kristiina Jokinen Her start: "The quality of #intelligence possessed by humans and #AI is fundamentally different."#Bridging2019 On his second slide: #AGI "needs fresh methods with cognitive architectures and philosophy of mind."#AI


Multi-Agent Manipulation via Locomotion using Hierarchical Sim2Real

arXiv.org Artificial Intelligence

Manipulation and locomotion are closely related problems that are often studied in isolation. In this work, we study the problem of coordinating multiple mobile agents to exhibit manipulation behaviors using a reinforcement learning (RL) approach. Our method hinges on the use of hierarchical sim2real -- a simulated environment is used to learn low-level goal-reaching skills, which are then used as the action space for a high-level RL controller, also trained in simulation. The full hierarchical policy is then transferred to the real world in a zero-shot fashion. The application of domain randomization during training enables the learned behaviors to generalize to real-world settings, while the use of hierarchy provides a modular paradigm for learning and transferring increasingly complex behaviors. We evaluate our method on a number of real-world tasks, including coordinated object manipulation in a multi-agent setting. See videos at https://sites.google.com/view/manipulation-via-locomotion


Decision making in dynamic and interactive environments based on cognitive hierarchy theory: Formulation, solution, and application to autonomous driving

arXiv.org Artificial Intelligence

Abstract-- In this paper, we describe a framework for autonomous decision making in a dynamic and interactive environment based on cognitive hierarchy theory. We model the in - teractions between the ego agent and its operating environm ent as a two-player dynamic game, and integrate cognitive behav - ioral models, Bayesian inference, and receding-horizon op timal control to define a dynamically-evolving decision strategy for the ego agent. Simulation examples representing autonomou s vehicle control in three traffic scenarios where the autonom ous ego vehicle interacts with a human-driven vehicle are repor ted. Autonomous systems are becoming more capable, better accepted, and more commonplace. Many autonomous systems, including collaborative robots [1] and self-driv ing cars [2], operate in dynamic and interactive environments.


A Review of Cooperative Multi-Agent Deep Reinforcement Learning

arXiv.org Artificial Intelligence

Deep Reinforcement Learning has made significant progress in multi-agent systems in recent years. In this review article, we have mostly focused on recent papers on Multi-Agent Reinforcement Learning (MARL) than the older papers, unless it was necessary. Several ideas and papers are proposed with different notations, and we tried our best to unify them with a single notation and categorize them by their relevance. In particular, we have focused on five common approaches on modeling and solving multi-agent reinforcement learning problems: (I) independent-learners, (II) fully observable critic, (III) value function decomposition, (IV) consensus, (IV) learn to communicate. Moreover, we discuss some new emerging research areas in MARL along with the relevant recent papers. In addition, some of the recent applications of MARL in real world are discussed. Finally, a list of available environments for MARL research are provided and the paper is concluded with proposals on the possible research directions.


Robotics and Autonomous Systems

#artificialintelligence

You can use algorithms and apps to systematically analyze, design, and visualize the behavior of complex systems in time and frequency domains. Automatically tune compensator parameters using interactive techniques such as bode loop shaping and the root locus method. You can tune gain-scheduled controllers and specify multiple tuning objectives, such as reference tracking, disturbance rejection, and stability margins. Code generation and requirements traceability helps you validate your system and certify compliance.


Large-scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning

arXiv.org Machine Learning

Finding the optimal signal timing strategy is a difficult task for the problem of large-scale traffic signal control (TSC). Multi-Agent Reinforcement Learning (MARL) is a promising method to solve this problem. However, there is still room for improvement in extending to large-scale problems and modeling the behaviors of other agents for each individual agent. In this paper, a new MARL, called Cooperative double Q-learning (Co-DQL), is proposed, which has several prominent features. It uses a highly scalable independent double Q-learning method based on double estimators and the UCB policy, which can eliminate the over-estimation problem existing in traditional independent Q-learning while ensuring exploration. It uses mean field approximation to model the interaction among agents, thereby making agents learn a better cooperative strategy. In order to improve the stability and robustness of the learning process, we introduce a new reward allocation mechanism and a local state sharing method. In addition, we analyze the convergence properties of the proposed algorithm. Co-DQL is applied on TSC and tested on a multi-traffic signal simulator. According to the results obtained on several traffic scenarios, Co- DQL outperforms several state-of-the-art decentralized MARL algorithms. It can effectively shorten the average waiting time of the vehicles in the whole road system.