Asia
Exploiting Coordination Locales in Distributed POMDPs via Social Model Shaping
Varakantham, Pradeep (Singapore Management University) | Kwak, Jun-young (University of Southern California) | Taylor, Matthew (University of Southern California) | Marecki, Janusz (IBM T. J Watson Research Center) | Scerri, Paul (Carnegie Mellon University) | Tambe, Milind (University of Southern California)
Distributed POMDPs provide an expressive framework for modelingย multiagent collaboration problems, but NEXP-Complete complexity hindersย their scalability and application in real-world domains. This paperย introduces a subclass of distributed POMDPs, and TREMOR, an algorithm to solve such distributed POMDPs. The primary novelty of TREMOR is that agents plan individually with a single agent POMDP solver and use social model shaping to implicitly coordinate with other agents. Experiments demonstrate that TREMOR can provide solutions orders of magnitude faster than existing algorithmsย while achieving comparable, or even superior, solution quality.
Pervasive Model Adaptation: The Integration of Planning and Information Gathering in Dynamic Production Systems
Liu, Juan (PARC) | Kuhn, Lukas (PARC) | Kleer, Johan de (PARC) | Zhou, Rong (PARC)
Model-based planning often presumes a static system model, while in a practice physical system may evolve or drift over time. This paper proposes the idea of pervasive model adaptation in a production system, where the model is dynamically updated using observation of production output. The core idea is the interplay between model adaptation and production planning. We seek plans which simultaneously serve the goals of achieving high productivity for production, and information gathering for model adaptation. We use a modular printing example to illustrate issues such as formulation of the information criterion and search strategy for informative plans. The idea of pervasive adaptation can be further extended to improve long term productivity in production systems.
Scalable, Parallel Best-First Search for Optimal Sequential Planning
Kishimoto, Akihiro (Tokyo Institute of Technology and JST PRESTO) | Fukunaga, Alex (Tokyo Institute of Technology) | Botea, Adi (NICTA and The Australian National University)
Large-scale, parallel clusters composed of commodity processors areย increasingly available, enabling the use of vast processingย capabilities and distributed RAM to solve hard search problems. ย We investigate parallel algorithms for optimal sequential planning, with an emphasis onย exploiting distributed memory computing clusters. ย In particular, we focus on an approach which distributes and schedulesย work among processors based on a hash function of the search state. ย We use this approach to parallelize the A* algorithm in theย optimal sequential version of the Fast Downward planner. ย The scaling behavior of the algorithm is evaluated experimentally onย clusters using up to 128 processors, a significant increase compared to previous work in parallelizing planners. ย We show that this approach scales well, allowing us to effectivelyย utilize the large amount of distributed memory to optimally solve problems whichย require hundreds of gigabytes of RAM to solve. We also show that this approach scalesย well for a single, shared-memory multicore machine.
Structural-Pattern Databases
Katz, Michael (Technion - Israel Institute of Technology) | Domshlak, Carmel (Technion - Israel Institute of Technology)
Explicit abstraction heuristics, notably pattern-database and merge-and-shrink heuristics, are employed by some state-of-the-art optimal heuristic-search planners. The major limitation of these abstraction heuristics is that the size of the abstract space has to be bounded by a (large) constant. Targeting this issue, Katz and Domshlak (2008b) introduced structural, and in particular fork-decomposition, abstractions, in which the planning task is abstracted by an instance of a tractable fragment of optimal planning. At first view, however, the lunch was not free. Some of the power of the explicit abstraction heuristics comes from pre-computing the heuristic function offline, and then determine h(s) for each evaluated state s by a very fast lookup in a "database." In contrast, fork-decomposition offer a poly-time, yet far from being fast, computation. ย In this contribution, we show that the time-per-node complexity bottleneck of the fork-decomposition heuristics can be successfully overcome. Specifically, we show that an equivalent of the explicit abstractions' notion of "database" exists for the fork-decomposition abstractions as well, and this despite of their exponential-size abstract spaces. Experimentally, we show that heuristic search with such "databased" fork-decomposition heuristics favorably competes with the state-of-the-art of optimal planning.
Landmarks, Critical Paths and Abstractions: What's the Difference Anyway?
Helmert, Malte (Albert-Ludwigs-Universitรคt Freiburg) | Domshlak, Carmel (Technion)
Current heuristic estimators for classical domain-independent planning are usually based on one of four ideas: delete relaxations , critical paths , abstractions , and, most recently, landmarks . Previously, these different ideas for deriving heuristic functions were largely unconnected. We prove that admissible heuristics based on these ideas are in fact very closely related. Exploiting this relationship, we introduce a new admissible heuristic called the landmark cut heuristic , which compares favourably with the state of the art in terms of heuristic accuracy and overall performance.
Efficient Solutions to Factored MDPs with Imprecise Transition Probabilities
Delgado, Karina Valdivia (University of Sao Paulo) | Sanner, Scott (NICTA-ANU) | Barros, Leliane Nunes de (University of Sao Paulo) | Cozman, Fabio Gagliardi (University of Sao Paulo)
When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) framework, it is often impossible to obtain a completely accurate estimate of transition probabilities. For example, natural uncertainty arises in the transition specification due to elicitation of MDP transition models from an expert or data, or non-stationary transition distributions arising from insufficient state knowledge. In the interest of obtaining the most robust policy under transition uncertainty, the Markov Decision Process with Imprecise Transition Probabilities (MDP-IPs) has been introduced to model such scenarios. Unfortunately, while solutions to the MDP-IP are well-known, they require nonlinear optimization and are extremely time-consuming in practice. To address this deficiency, we propose efficient dynamic programming methods to exploit the structure of factored MDPIPs. Noting that the key computational bottleneck in the solution of MDP-IPs is the need to repeatedly solve nonlinear constrained optimization problems, we show how to target approximation techniques to drastically reduce the computational overhead of the nonlinear solver while producing bounded, approximately optimal solutions. Our results show up to two orders of magnitude speedup in comparison to traditional โflatโ dynamic programming approaches and up to an order of magnitude speedup over the extension of factored MDP approximate value iteration techniques to MDP-IPs.
A Human-Aware Robot Task Planner
Cirillo, Marcello (รrebro University) | Karlsson, Lars (รrebro University) | Saffiotti, Alessandro (รrebro University)
The growing presence of household robots in inhabited environments arises the need for new robot task planning techniques. These techniques should take into consideration not only the actions that the robot can perform or unexpected external events, but also the actions performed by a human sharing the same environment, in order to improve the cohabitation of the two agents, e.g., by avoiding undesired situations for the human. In this paper, we present a human-aware planner able to address this problem. This planner supports alternative hypotheses of the human plan, temporal duration for the actions of both the robot and the human, constraints on the interaction between robot and human, partial goal achievement and, most importantly, the possibility to use observations of human actions in the policy generated for the robot. The planner has been tested as a standalone component and in conjunction with our framework for human-robot interaction in a real environment.
Enhancing the Context-Enhanced Additive Heuristic with Precedence Constraints
Cai, Dunbo (Jilin University) | Hoffmann, Joerg (SAP Research) | Helmert, Malte (Albert-Ludwigs-Universitaet Freiburg)
Recently, Helmert and Geffner proposed the context-enhanced additive heuristic, where fact costs are evaluated relative to context states that arise from achieving first a pivot condition of each operator. As Helmert and Geffner pointed out, the method can be generalized to consider contexts arising from arbitrary precedence constraints over operator conditions instead. Herein, we provide such a generalization. We extend Helmert and Geffner's equations, and discuss a number of design choices that arise. Drawing on previous work on goal orderings, we design a family of methods for automatically generating precedence constraints. We run large-scale experiments, showing that the technique can help significantly, depending on the choice of precedence constraints. We shed some light on this by profiling the behavior of all possible precedence constraints, using a sampling technique.
Suboptimal and Anytime Heuristic Search on Multi-Core Machines
Burns, Ethan (University of New Hampshire) | Lemons, Seth (University of New Hampshire) | Ruml, Wheeler (University of New Hampshire) | Zhou, Rong (Palo Alto Research Center)
In order to scale with modern processors, planning algorithms must become multi-threaded. In this paper, we present parallel shared-memory algorithms for two problems that underlie many planning systems: suboptimal and anytime heuristic search. We extend a recently-proposed approach for parallel optimal search to the suboptimal case, providing two new pruning rules for bounded suboptimal search. We also show how this new approach can be used for parallel anytime search. Using temporal logic, we prove the correctness of our framework, and in an empirical comparison on STRIPS planning, grid pathfinding, and sliding tile puzzle problems using an 8-core machine, we show that it yields faster search performance than previous proposals.
Clustering Based on Pairwise Distances When the Data is of Mixed Dimensions
In the context of clustering, we consider a generative model in a Euclidean ambient space with clusters of different shapes, dimensions, sizes and densities. In an asymptotic setting where the number of points becomes large, we obtain theoretical guaranties for a few emblematic methods based on pairwise distances: a simple algorithm based on the extraction of connected components in a neighborhood graph; the spectral clustering method of Ng, Jordan and Weiss; and hierarchical clustering with single linkage. The methods are shown to enjoy some near-optimal properties in terms of separation between clusters and robustness to outliers. The local scaling method of Zelnik-Manor and Perona is shown to lead to a near-optimal choice for the scale in the first two methods. We also provide a lower bound on the spectral gap to consistently choose the correct number of clusters in the spectral method.