AITopics | Agents

Collaborating Authors

Agents

News Overviews Instructional Materials AI-Alerts Classics

Decentralized Motion Planning for Multi-Robot Navigation using Deep Reinforcement Learning

Kandhasamy, Sivanathan, Kuppusamy, Vinayagam Babu, Samak, Tanmay Vilas, Samak, Chinmay Vilas

arXiv.org Artificial IntelligenceNov-11-2020

This work presents a decentralized motion planning framework for addressing the task of multi-robot navigation using deep reinforcement learning. A custom simulator was developed in order to experimentally investigate the navigation problem of 4 cooperative non-holonomic robots sharing limited state information with each other in 3 different settings. The notion of decentralized motion planning with common and shared policy learning was adopted, which allowed robust training and testing of this approach in a stochastic environment since the agents were mutually independent and exhibited asynchronous motion behavior. The task was further aggravated by providing the agents with a sparse observation space and requiring them to generate continuous action commands so as to efficiently, yet safely navigate to their respective goal locations, while avoiding collisions with other dynamic peers and static obstacles at all times. The experimental results are reported in terms of quantitative measures and qualitative remarks for both training and deployment phases.

agent, motion planning, robot, (14 more...)

arXiv.org Artificial Intelligence

2011.05605

Country: Asia > India > Tamil Nadu (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.93)

Add feedback

Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences

Baker, Bowen

arXiv.org Artificial IntelligenceNov-10-2020

Multi-agent reinforcement learning (MARL) has shown recent success in increasingly complex fixed-team zero-sum environments. However, the real world is not zero-sum nor does it have fixed teams; humans face numerous social dilemmas and must learn when to cooperate and when to compete. To successfully deploy agents into the human world, it may be important that they be able to understand and help in our conflicts. Unfortunately, selfish MARL agents typically fail when faced with social dilemmas. In this work, we show evidence of emergent direct reciprocity, indirect reciprocity and reputation, and team formation when training agents with randomized uncertain social preferences (RUSP), a novel environment augmentation that expands the distribution of environments agents play in. RUSP is generic and scalable; it can be applied to any multi-agent environment without changing the original underlying game dynamics or objectives. In particular, we show that with RUSP these behaviors can emerge and lead to higher social welfare equilibria in both classic abstract social dilemmas like Iterated Prisoner's Dilemma as well in more complex intertemporal environments.

agent, prisoner, social dilemma, (12 more...)

arXiv.org Artificial Intelligence

2011.05373

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Combining Propositional Logic Based Decision Diagrams with Decision Making in Urban Systems

Ling, Jiajing, Chandak, Kushagra, Kumar, Akshat

arXiv.org Artificial IntelligenceNov-10-2020

Solving multiagent problems can be an uphill task due to uncertainty in the environment, partial observability, and scalability of the problem at hand. Especially in an urban setting, there are more challenges since we also need to maintain safety for all users while minimizing congestion of the agents as well as their travel times. To this end, we tackle the problem of multiagent pathfinding under uncertainty and partial observability where the agents are tasked to move from their starting points to ending points while also satisfying some constraints, e.g., low congestion, and model it as a multiagent reinforcement learning problem. We compile the domain constraints using propositional logic and integrate them with the RL algorithms to enable fast simulation for RL.

agent, node, simple path, (15 more...)

arXiv.org Artificial Intelligence

2011.04405

Country:

Asia > Singapore (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.82)

Industry: Transportation (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Add feedback

Two motivational artificial beings are better than one for enhancing learning

#artificialintelligenceNov-9-2020, 21:55:04 GMT

Social rewards such as praise are known to enhance various stages of the learning process. Now, researchers from Japan have found that praise delivered by artificial beings such as robots and virtual graphics-based agents can have effects similar to praise delivered by humans, with important practical applications as social services such as education increasingly move to virtual and online platforms. In a study published this month in PLOS ONE,researchers from the University of Tsukuba have shown that motor task performance in participants was significantly enhanced by praise from either one or two robots or virtual agents. Although praise from robots and virtual agents has been found to enhance human motivation and performance during a task, whether these interactions have similar effects on offline skill consolidation, which is an essential component of the learning process, has not been investigated. Further, the various conditions associated with the delivery of praise by robot and virtual agents have not been thoroughly explored previously. The researchers at the University of Tsukuba aimed to address these questions in the present study.

agent, task performance, virtual agent, (11 more...)

#artificialintelligence

Country: Asia > Japan > Honshū > Kantō > Ibaraki Prefecture > Tsukuba (0.48)

Genre: Research Report > New Finding (0.38)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.87)

Add feedback

KRAFTON, Inc. Announces Global Collaboration With Microsoft Azure

#artificialintelligenceNov-9-2020, 11:30:05 GMT

KRAFTON, Inc. announced it is working with Microsoft Azure to host its portfolio of multiplatform products. The deal will include products directly operated by the company and its subsidiaries, including PUBG Corporation's multiplayer battle royale PLAYERUNKNOWN'S BATTLEGROUNDS (PUBG) on PC and consoles, in addition to PUBG MOBILE. Azure is Microsoft's public cloud computing service empowering game creators to build, run, and grow their games on a global scale. With privacy and data security being a top priority for KRAFTON, the company will be working with Microsoft to ensure personal data protection through Azure. The collaboration will ensure that privacy rights are respected and relevant software will be in full compliance with all applicable laws and regulations.

announce global collaboration, krafton, microsoft azure, (3 more...)

#artificialintelligence

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.40)

Add feedback

Voice Tech Summit Middle East - Online

#artificialintelligenceNov-9-2020, 09:45:19 GMT

Dr Hafiz Farooq Ahmad is an Associate Professor at the College of Computer Sciences and Information Technology (CCSIT), King Faisal University, Al Ahsa, Saudi Arabia. He holds a PhD in Distributed Computing from Tokyo Institute of Technology, Tokyo, Japan. He has research interest in semantics systems, machine learning, health informatics and web application security. He is the pioneer for Semantic Web Application Firewall (SWAF) in cooperation with DTS Inc Japan in 2010. He contributed in agent cites project, a European funded research and development project for agent systems.

japan, online, voice tech summit middle east

#artificialintelligence

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.92)
Asia > Middle East > Saudi Arabia > Eastern Province > Al-Ahsa Governorate > Al-Hofuf (0.30)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.78)
Information Technology > Communications > Web > Semantic Web (0.66)

Add feedback

Neural Bridge Sampling for Evaluating Safety-Critical Autonomous Systems

Sinha, Aman, O'Kelly, Matthew, Tedrake, Russ, Duchi, John

arXiv.org Machine LearningNov-9-2020

Learning-based methodologies increasingly find applications in safety-critical domains like autonomous driving and medical robotics. Due to the rare nature of dangerous events, real-world testing is prohibitively expensive and unscalable. In this work, we employ a probabilistic approach to safety evaluation in simulation, where we are concerned with computing the probability of dangerous events. We develop a novel rare-event simulation method that combines exploration, exploitation, and optimization techniques to find failure modes and estimate their rate of occurrence. We provide rigorous guarantees for the performance of our method in terms of both statistical and computational efficiency. Finally, we demonstrate the efficacy of our approach on a variety of scenarios, illustrating its usefulness as a tool for rapid sensitivity analysis and model comparison that are essential to developing and testing safety-critical autonomous systems.

arxiv preprint arxiv, neural network, upstream oil & gas, (20 more...)

arXiv.org Machine Learning

2008.10581

Country:

North America > United States > California (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Massachusetts (0.14)
Asia (0.14)

Genre: Research Report (0.50)

Industry:

Aerospace & Defense (0.68)
Information Technology > Robotics & Automation (0.48)
Energy > Oil & Gas > Upstream (0.34)
Transportation > Ground > Road (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.88)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.70)
(2 more...)

Add feedback

Social-STAGE: Spatio-Temporal Multi-Modal Future Trajectory Forecast

Malla, Srikanth, Choi, Chiho, Dariush, Behzad

arXiv.org Artificial IntelligenceNov-9-2020

This paper considers the problem of multi-modal future trajectory forecast with ranking. Here, multi-modality and ranking refer to the multiple plausible path predictions and the confidence in those predictions, respectively. We propose Social-STAGE, Social interaction-aware Spatio-Temporal multi-Attention Graph convolution network with novel Evaluation for multi-modality. Our main contributions include analysis and formulation of multi-modality with ranking using interaction and multi-attention, and introduction of new metrics to evaluate the diversity and associated confidence of multi-modal predictions. We evaluate our approach on existing public datasets ETH and UCY and show that the proposed algorithm outperforms the state of the arts on these datasets.

prediction, probability, trajectory, (15 more...)

arXiv.org Artificial Intelligence

2011.04853

Country: North America > United States > California > Santa Clara County > San Jose (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback

Multi-Agent Active Search using Realistic Depth-Aware Noise Model

Ghods, Ramina, Durkin, William J., Schneider, Jeff

arXiv.org Artificial IntelligenceNov-9-2020

The search for objects of interest in an unknown environment by making data-collection decisions (i.e., active search or active sensing) has robotics applications in many fields, including the search and rescue of human survivors following disasters, detecting gas leaks or locating and preventing animal poachers. Existing algorithms often prioritize the location accuracy of objects of interest while other practical issues such as the reliability of object detection as a function of distance and lines of sight remain largely ignored. An additional challenge is that in many active search scenarios, communication infrastructure may be damaged, unreliable, or unestablished, making centralized control of multiple search agents impractical. We present an algorithm called Noise-Aware Thompson Sampling (NATS) that addresses these issues for multiple ground-based robot agents performing active search considering two sources of sensory information from monocular optical imagery and sonar tracking. NATS utilizes communications between robot agents in a decentralized manner that is robust to intermittent loss of communication links. Additionally, it takes into account object detection uncertainty from depth as well as environmental occlusions. Using simulation results, we show that NATS significantly outperforms existing methods such as information-greedy policies or exhaustive search. We demonstrate the real-world viability of NATS using a photo-realistic environment created in the Unreal Engine 4 game development platform with the AirSim plugin.

agent, algorithm, detector, (14 more...)

arXiv.org Artificial Intelligence

2011.04825

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Wisconsin (0.04)
North America > United States > Ohio > Franklin County > Columbus (0.04)

Genre:

Overview (0.68)
Research Report (0.64)

Industry: Leisure & Entertainment > Games > Computer Games (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning

Liu, Shuijing, Chang, Peixin, Liang, Weihang, Chakraborty, Neeloy, Driggs-Campbell, Katherine

arXiv.org Artificial IntelligenceNov-9-2020

Safe and efficient navigation through human crowds is an essential capability for mobile robots. Previous work on robot crowd navigation assumes that the dynamics of all agents are known and well-defined. In addition, the performance of previous methods deteriorates in partially observable environments and environments with dense crowds. To tackle these problems, we propose decentralized structural-Recurrent Neural Network (DS-RNN), a novel network that reasons about spatial and temporal relationships for robot decision making in crowd navigation. We train our network with model-free deep reinforcement learning without any expert supervision. We demonstrate that our model outperforms previous methods and successfully transfer the policy learned in the simulator to a real-world TurtleBot 2i.

international conference, navigation, robot, (12 more...)

arXiv.org Artificial Intelligence

2011.0482

Country: North America > United States > Illinois (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback