Agents
Minimizing Maximum Regret in Commitment Constrained Sequential Decision Making
Zhang, Qi (University of Michigan) | Singh, Satinder (University of Michigan) | Durfee, Edmund (University of Michigan)
In cooperative multiagent planning, it can often be beneficial for an agent to make commitments about aspects of its behavior to others, allowing them in turn to plan their own behaviors without taking the agent's detailed behavior into account. Extending previous work in the Bayesian setting, we consider instead a worst-case setting in which the agent has a set of possible environments (MDPs) it could be in, and develop a commitment semantics that allows for probabilistic guarantees on the agent's behavior in any of the environments it could end up facing. Crucially, an agent receives observations (of reward and state transitions) that allow it to potentially eliminate possible environments and thus obtain higher utility by adapting its policy to the history of observations. We develop algorithms and provide theory and some preliminary empirical results showing that they ensure an agent meets its commitments with history-dependent policies while minimizing maximum regret over the possible environments.
Approximately-Optimal Queries for Planning in Reward-Uncertain Markov Decision Processes
Zhang, Shun (University of Michigan) | Durfee, Edmund (University of Michigan) | Singh, Satinder (University of Michigan)
When planning actions to take on behalf of its human operator, a robot might be uncertain about its operator's reward function. We address the problem of how the robot should formulate an (approximately) optimal query to pose to the operator, given how its uncertainty affects which policies it should plan to pursue. We explain how a robot whose queries ask the operator to choose the best from among k choices can, without loss of optimality, restrict consideration to choices only over alternative policies. Further, we present a method for constructing an approximately-optimal policy query that enjoys a performance bound, where the method need not enumerate all policies. Finally, because queries posed to the operator of a robotic system are often expressed in terms of preferences over trajectories rather than policies, we show how our constructed policy query can be projected into the space of trajectory queries. Our empirical results demonstrate that our projection technique can outperform other known techniques for choosing trajectory queries, particularly when the number of trajectories the operator is asked to compare is small.
Using Hierarchical Constraints to Avoid Conflicts in Multi-Agent Pathfinding
Walker, Thayne T. (University of Denver) | Chan, David M. (University of Denver) | Sturtevant, Nathan R. (University of Denver)
Recent work in multi-agent path planning has provided a number of optimal and suboptimal solvers that can efficiently find solutions to problems of growing complexity. Solvers based on Conflict-Based Search (CBS) combine single-agent solvers with shared constraints between agents to find feasible solutions. Suboptimal variants of CBS introduce alternate heuristics to avoid conflicts. In this paper we study the multi-agent planning problem in the context of non-holonomic vehicles planning on a lattice. We propose that in addition to using heuristics to avoid conflicts, we can plan using a hierarchy of movement constraints to efficiently avoid conflicts. We develop a new extension to the CBS algorithm, CBS with constraint layering (CBS+CL), which iteratively applies different movement constraint models during the CBS planning process. Our results show that this approach allows us to solve for about 2.4 times more agents in the same amount of time when compared to regular CBS without using a constraint hierarchy.
The Limits of Strong Privacy Preserving Multi-Agent Planning
Tožička, Jan (Czech Technical University in Prague) | Štolba, Michal (Czech Technical University in Prague) | Komenda, Antonín (Czech Technical University in Prague)
Multi-agent planning using MA-STRIPS-related models is often motivated by the preservation of private information. Such motivation is not only natural for multi-agent systems, but it is one of the main reasons, why multi-agent planning (MAP) problems cannot be solved centrally. In this paper, we analyze privacy-preserving multi-agent planning (PP-MAP) from the perspective of secure multiparty computation (MPC). We discuss the concept of strong privacy and its implications and present two variants of a novel planner, provably strong privacy-preserving in general. As the main contribution, we formulate the limits of strong privacy-preserving planning in the terms of privacy, completeness and efficiency and show that, for a wide class of planning algorithms, all three properties are not achievable at once. Moreover, we provide a restricted variant of strong privacy based on equivalence classes of planning problems and show that an efficient, complete and strong privacy-preserving planner exists for such restriction.
Multi-Agent Ergodic Coverage with Obstacle Avoidance
Salman, Hadi (Carnegie Mellon University) | Ayvali, Elif (Carnegie Mellon University) | Choset, Howie (Carnegie Mellon University)
Autonomous exploration and search have important applications in robotics. One interesting application is cooperative control of mobile robotic/sensor networks to achieve uniform coverage of a domain. Ergodic coverage is one solution for this problem in which control laws for the agents are derived so that the agents uniformly cover a target area while maintaining coordination with each other. Prior approaches have assumed the target regions contain no obstacles. In this work, we tackle the problem of static and dynamic obstacle avoidance while maintaining an ergodic coverage goal. We pursue a vector-field-based obstacle avoidance approach and define control laws for idealized kinematic and dynamic systems that avoid static and dynamic obstacles while maintaining ergodicity. We demonstrate this obstacle avoidance methodology via numerical simulation and show how ergodicity is maintained. Keywords-- Multi-agent planning, centralized robot control, ergodic theory, uniform coverage, obstacle avoidance.
Multiagent Online Planning with Nested Beliefs and Dialogue
Kominis, Filippos (Universitat Pompeu Fabra) | Geffner, Hector (Universitat Pompeu Fabra)
The problem of planning with partial observability in the presence of a single agent has been addressed as a contingent or POMDP problem. Since the task is computationally hard, on-line approaches have also been developed that just compute the action to do next rather than full policies. In this work, we address a similar problem but in a multiagent setting where agents share a common goal and plan with beliefs which are about the world and the possibly nested beliefs of other agents. For this, we extend the belief tracking formulation of Kominis and Geffner to the on-line setting where plans are supposed to work for the true hidden state as revealed by the observations, and develop an alternative translation into classical planning that is used within a plan-execute-observe-and-replan cycle. Planning is done from the perspective of the agents, and there is a single planning agent in each replanning episode that can change across episodes. We present empirical results and show that interesting agent dialogues arise in this setting where agents collaborate by requesting or volunteering information in a goal-directed manner.
Automated Verification of Social Law Robustness in STRIPS
Karpas, Erez (The Technion-Israel Institute of Technology) | Shleyfman, Alexander (The Technion-Israel Institute of Technology) | Tennenholtz, Moshe (The Technion-Israel Institute of Technology)
Agents operating in a multi-agent environment must consider not just their own actions, but also those of the other agents in the system. Artificial social systems are a well known means for coordinating a set of agents, without requiring centralized planning or online negotiation between agents. Artificial social systems enact a social law which restricts the agents from performing some actions under some circumstances. A good social law prevents the agents from interfering with each other, but does not prevent them from achieving their goals. However, designing good social laws, or even checking whether a proposed social law is good, are hard questions. In this paper, we take a first step towards automating these processes, by formulating criteria for good social laws in a multi-agent planning framework. We then describe an automated technique for verifying if a proposed social law meets these criteria, based on a compilation to classical planning.
Planning Time to Think: Metareasoning for On-Line Planning with Durative Actions
Cserna, Bence (University of New Hampshire) | Ruml, Wheeler (University of New Hampshire) | Frank, Jeremy (NASA Ames Research Center)
When minimizing makespan during off-line planning, the fastest action sequence to reach a particular state is, by definition, preferred. When trying to reach a goal quickly in on-line planning, previous work has inherited that assumption: the faster of two paths that both reach the same state is usually considered to dominate the slower one. In this short paper, we point out that, when planning happens concurrently with execution, selecting a slower action can allow additional time for planning, leading to better plans. We present Slo'RTS, a metareasoning planning algorithm that estimates whether the expected improvement in future decision-making from this increased planning time is enough to make up for the increased duration of the selected action. Using simple benchmarks, we show that Slo'RTS can yield shorter time-to-goal than a conventional planner. This generalizes previous work on metareasoning in on-line planning and highlights the inherent uncertainty present in an on-line setting.
China's AI Advances for Drones to Enable 'Swarm Intelligence' Collection
The 119 drones underwent catapult-assisted take-offs and performed aerial formations, the Xinhua News Agency reported on Sunday. The CETC said "swarm intelligence" is regarded as the core of the artificial intelligence of unmanned systems and the future of intelligent unmanned systems. The huge scale of low cost and multi-function UAVs could be used in risky tasks such as emergency communications. CETC engineer Zhao Yanjie said since drones were invented in 1917, intelligent swarms have "changed the rules of the game." In November 2016, the CETC launched 67 drones during the China International Aviation & Aerospace Exhibition in Zhuhai, South China's Guangdong Province, breaking the previous record of 50 drones by the US Navy, CCTV reported.
Modifying Optimal SAT-Based Approach to Multi-Agent Path-Finding Problem to Suboptimal Variants
Surynek, Pavel (National Institute of Advanced Industrial Science and Technology) | Felner, Ariel (Ben-Gurion University of the Negev) | Stern, Roni (Ben-Gurion University of the Negev) | Boyarski, Eli (Bar-Ilan University)
In multi-agent path finding (MAPF) the task is to find non-conflicting paths for multiple agents. Recently, a SAT-based approach was developed to solve this problem and proved beneficial in many cases when compared to other search-based solvers. In this paper, we introduce SAT-based unbounded- and bounded-suboptimal algorithms and compare them to relevant search-based algorithms.