Agents
Decentralized Motion Planning for Multi-Robot Navigation using Deep Reinforcement Learning
Kandhasamy, Sivanathan, Kuppusamy, Vinayagam Babu, Samak, Tanmay Vilas, Samak, Chinmay Vilas
This work presents a decentralized motion planning framework for addressing the task of multi-robot navigation using deep reinforcement learning. A custom simulator was developed in order to experimentally investigate the navigation problem of 4 cooperative non-holonomic robots sharing limited state information with each other in 3 different settings. The notion of decentralized motion planning with common and shared policy learning was adopted, which allowed robust training and testing of this approach in a stochastic environment since the agents were mutually independent and exhibited asynchronous motion behavior. The task was further aggravated by providing the agents with a sparse observation space and requiring them to generate continuous action commands so as to efficiently, yet safely navigate to their respective goal locations, while avoiding collisions with other dynamic peers and static obstacles at all times. The experimental results are reported in terms of quantitative measures and qualitative remarks for both training and deployment phases.
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
Multi-agent reinforcement learning (MARL) has shown recent success in increasingly complex fixed-team zero-sum environments. However, the real world is not zero-sum nor does it have fixed teams; humans face numerous social dilemmas and must learn when to cooperate and when to compete. To successfully deploy agents into the human world, it may be important that they be able to understand and help in our conflicts. Unfortunately, selfish MARL agents typically fail when faced with social dilemmas. In this work, we show evidence of emergent direct reciprocity, indirect reciprocity and reputation, and team formation when training agents with randomized uncertain social preferences (RUSP), a novel environment augmentation that expands the distribution of environments agents play in. RUSP is generic and scalable; it can be applied to any multi-agent environment without changing the original underlying game dynamics or objectives. In particular, we show that with RUSP these behaviors can emerge and lead to higher social welfare equilibria in both classic abstract social dilemmas like Iterated Prisoner's Dilemma as well in more complex intertemporal environments.
Combining Propositional Logic Based Decision Diagrams with Decision Making in Urban Systems
Ling, Jiajing, Chandak, Kushagra, Kumar, Akshat
Solving multiagent problems can be an uphill task due to uncertainty in the environment, partial observability, and scalability of the problem at hand. Especially in an urban setting, there are more challenges since we also need to maintain safety for all users while minimizing congestion of the agents as well as their travel times. To this end, we tackle the problem of multiagent pathfinding under uncertainty and partial observability where the agents are tasked to move from their starting points to ending points while also satisfying some constraints, e.g., low congestion, and model it as a multiagent reinforcement learning problem. We compile the domain constraints using propositional logic and integrate them with the RL algorithms to enable fast simulation for RL.
Two motivational artificial beings are better than one for enhancing learning
Social rewards such as praise are known to enhance various stages of the learning process. Now, researchers from Japan have found that praise delivered by artificial beings such as robots and virtual graphics-based agents can have effects similar to praise delivered by humans, with important practical applications as social services such as education increasingly move to virtual and online platforms. In a study published this month in PLOS ONE,researchers from the University of Tsukuba have shown that motor task performance in participants was significantly enhanced by praise from either one or two robots or virtual agents. Although praise from robots and virtual agents has been found to enhance human motivation and performance during a task, whether these interactions have similar effects on offline skill consolidation, which is an essential component of the learning process, has not been investigated. Further, the various conditions associated with the delivery of praise by robot and virtual agents have not been thoroughly explored previously. The researchers at the University of Tsukuba aimed to address these questions in the present study.
KRAFTON, Inc. Announces Global Collaboration With Microsoft Azure
KRAFTON, Inc. announced it is working with Microsoft Azure to host its portfolio of multiplatform products. The deal will include products directly operated by the company and its subsidiaries, including PUBG Corporation's multiplayer battle royale PLAYERUNKNOWN'S BATTLEGROUNDS (PUBG) on PC and consoles, in addition to PUBG MOBILE. Azure is Microsoft's public cloud computing service empowering game creators to build, run, and grow their games on a global scale. With privacy and data security being a top priority for KRAFTON, the company will be working with Microsoft to ensure personal data protection through Azure. The collaboration will ensure that privacy rights are respected and relevant software will be in full compliance with all applicable laws and regulations.
Voice Tech Summit Middle East - Online
Dr Hafiz Farooq Ahmad is an Associate Professor at the College of Computer Sciences and Information Technology (CCSIT), King Faisal University, Al Ahsa, Saudi Arabia. He holds a PhD in Distributed Computing from Tokyo Institute of Technology, Tokyo, Japan. He has research interest in semantics systems, machine learning, health informatics and web application security. He is the pioneer for Semantic Web Application Firewall (SWAF) in cooperation with DTS Inc Japan in 2010. He contributed in agent cites project, a European funded research and development project for agent systems.
Neural Bridge Sampling for Evaluating Safety-Critical Autonomous Systems
Sinha, Aman, O'Kelly, Matthew, Tedrake, Russ, Duchi, John
Learning-based methodologies increasingly find applications in safety-critical domains like autonomous driving and medical robotics. Due to the rare nature of dangerous events, real-world testing is prohibitively expensive and unscalable. In this work, we employ a probabilistic approach to safety evaluation in simulation, where we are concerned with computing the probability of dangerous events. We develop a novel rare-event simulation method that combines exploration, exploitation, and optimization techniques to find failure modes and estimate their rate of occurrence. We provide rigorous guarantees for the performance of our method in terms of both statistical and computational efficiency. Finally, we demonstrate the efficacy of our approach on a variety of scenarios, illustrating its usefulness as a tool for rapid sensitivity analysis and model comparison that are essential to developing and testing safety-critical autonomous systems.
Social-STAGE: Spatio-Temporal Multi-Modal Future Trajectory Forecast
Malla, Srikanth, Choi, Chiho, Dariush, Behzad
This paper considers the problem of multi-modal future trajectory forecast with ranking. Here, multi-modality and ranking refer to the multiple plausible path predictions and the confidence in those predictions, respectively. We propose Social-STAGE, Social interaction-aware Spatio-Temporal multi-Attention Graph convolution network with novel Evaluation for multi-modality. Our main contributions include analysis and formulation of multi-modality with ranking using interaction and multi-attention, and introduction of new metrics to evaluate the diversity and associated confidence of multi-modal predictions. We evaluate our approach on existing public datasets ETH and UCY and show that the proposed algorithm outperforms the state of the arts on these datasets.
Multi-Agent Active Search using Realistic Depth-Aware Noise Model
Ghods, Ramina, Durkin, William J., Schneider, Jeff
The search for objects of interest in an unknown environment by making data-collection decisions (i.e., active search or active sensing) has robotics applications in many fields, including the search and rescue of human survivors following disasters, detecting gas leaks or locating and preventing animal poachers. Existing algorithms often prioritize the location accuracy of objects of interest while other practical issues such as the reliability of object detection as a function of distance and lines of sight remain largely ignored. An additional challenge is that in many active search scenarios, communication infrastructure may be damaged, unreliable, or unestablished, making centralized control of multiple search agents impractical. We present an algorithm called Noise-Aware Thompson Sampling (NATS) that addresses these issues for multiple ground-based robot agents performing active search considering two sources of sensory information from monocular optical imagery and sonar tracking. NATS utilizes communications between robot agents in a decentralized manner that is robust to intermittent loss of communication links. Additionally, it takes into account object detection uncertainty from depth as well as environmental occlusions. Using simulation results, we show that NATS significantly outperforms existing methods such as information-greedy policies or exhaustive search. We demonstrate the real-world viability of NATS using a photo-realistic environment created in the Unreal Engine 4 game development platform with the AirSim plugin.
Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning
Liu, Shuijing, Chang, Peixin, Liang, Weihang, Chakraborty, Neeloy, Driggs-Campbell, Katherine
Safe and efficient navigation through human crowds is an essential capability for mobile robots. Previous work on robot crowd navigation assumes that the dynamics of all agents are known and well-defined. In addition, the performance of previous methods deteriorates in partially observable environments and environments with dense crowds. To tackle these problems, we propose decentralized structural-Recurrent Neural Network (DS-RNN), a novel network that reasons about spatial and temporal relationships for robot decision making in crowd navigation. We train our network with model-free deep reinforcement learning without any expert supervision. We demonstrate that our model outperforms previous methods and successfully transfer the policy learned in the simulator to a real-world TurtleBot 2i.