Country
Extending the Use of Inference in Temporal Planning as Forwards Search
Coles, Amanda Jane (University of Strathclyde) | Coles, Andrew Ian (University of Strathclyde) | Fox, Maria (University of Strathclyde) | Long, Derek (University of Strathclyde)
PDDL 2.1 supports modelling of complex temporal planning domains in which solutions must exploit concurrency. Few existing temporal planners can solve problems that require concurrency and those that do typically pay a performance price to deploy reasoning machinery that is not always required. In this paper we show how to improve the performance of forward-search planners that attempt to solve the full temporal planning problem, both by narrowing the use of the concurrency machinery to situations that demand it and also by improving the power of inference to prune redundant branches of the search space for common patterns of interaction in temporal domains that do require concurrency. Results illustrate the effectiveness of our ideas in improving the efficiency of a temporal planner that can solve problems with required concurrency, both in domains that exploit this ability and in those that do not.
A Human-Aware Robot Task Planner
Cirillo, Marcello (Örebro University) | Karlsson, Lars (Örebro University) | Saffiotti, Alessandro (Örebro University)
The growing presence of household robots in inhabited environments arises the need for new robot task planning techniques. These techniques should take into consideration not only the actions that the robot can perform or unexpected external events, but also the actions performed by a human sharing the same environment, in order to improve the cohabitation of the two agents, e.g., by avoiding undesired situations for the human. In this paper, we present a human-aware planner able to address this problem. This planner supports alternative hypotheses of the human plan, temporal duration for the actions of both the robot and the human, constraints on the interaction between robot and human, partial goal achievement and, most importantly, the possibility to use observations of human actions in the policy generated for the robot. The planner has been tested as a standalone component and in conjunction with our framework for human-robot interaction in a real environment.
Enhancing the Context-Enhanced Additive Heuristic with Precedence Constraints
Cai, Dunbo (Jilin University) | Hoffmann, Joerg (SAP Research) | Helmert, Malte (Albert-Ludwigs-Universitaet Freiburg)
Recently, Helmert and Geffner proposed the context-enhanced additive heuristic, where fact costs are evaluated relative to context states that arise from achieving first a pivot condition of each operator. As Helmert and Geffner pointed out, the method can be generalized to consider contexts arising from arbitrary precedence constraints over operator conditions instead. Herein, we provide such a generalization. We extend Helmert and Geffner's equations, and discuss a number of design choices that arise. Drawing on previous work on goal orderings, we design a family of methods for automatically generating precedence constraints. We run large-scale experiments, showing that the technique can help significantly, depending on the choice of precedence constraints. We shed some light on this by profiling the behavior of all possible precedence constraints, using a sampling technique.
Suboptimal and Anytime Heuristic Search on Multi-Core Machines
Burns, Ethan (University of New Hampshire) | Lemons, Seth (University of New Hampshire) | Ruml, Wheeler (University of New Hampshire) | Zhou, Rong (Palo Alto Research Center)
In order to scale with modern processors, planning algorithms must become multi-threaded. In this paper, we present parallel shared-memory algorithms for two problems that underlie many planning systems: suboptimal and anytime heuristic search. We extend a recently-proposed approach for parallel optimal search to the suboptimal case, providing two new pruning rules for bounded suboptimal search. We also show how this new approach can be used for parallel anytime search. Using temporal logic, we prove the correctness of our framework, and in an empirical comparison on STRIPS planning, grid pathfinding, and sliding tile puzzle problems using an 8-core machine, we show that it yields faster search performance than previous proposals.
Automatic Derivation of Memoryless Policies and Finite-State Controllers Using Classical Planners
Bonet, Blai (Universidad Simón Bolívar) | Palacios, Héctor (Universidad Simón Bolívar) | Geffner, Héctor (ICREA and Universitat Pompeu Fabra)
Finite-state and memoryless controllers are simple action selection mechanisms widely used in domains such as video-games and mobile robotics. Memoryless controllers stand for functions that map observations into actions, while finite-state controllers generalize memoryless ones with a finite amount of memory. In contrast to the policies obtained from MDPs and POMDPs, finite-state controllers have two advantages: they are often extremely compact, involving a small number of controller states or none at all, and they are general, applying to many problems and not just one. A limitation of finite-state controllers is that they must be written by hand. In this work, we address this limitation, and develop a method for deriving finite-state controllers automatically from models. These models represent a class of contingent problems where actions are deterministic and some fluents are observable. The problem of deriving a controller from such models is converted into a conformant planning problem that is solved using classical planners, taking advantage of a complete translation introduced recently. The controllers derived in this way are 'general' in the sense that they do not solve the original problem only, but many variations as well, including changes in the size of the problem or in the uncertainty of the initial situation and action effects. Experiments illustrating the derivation of such controllers are presented.
Lower Bounding Klondike Solitaire with Monte-Carlo Planning
Bjarnason, Ronald (Oregon State University) | Fern, Alan (Oregon State University) | Tadepalli, Prasad (Oregon State University)
Despite its ubiquitous presence, very little is known about the odds of winning the simple card game of Klondike Solitaire. The main goal of this paper is to investigate the use of probabilistic planning to shed light on this issue. Unfortunatley, most probabilistic planning techniques are not well suited for Klondike due to the difficulties of representing the domain in standard planning languages and the complexity of the required search. Klondike thus serves as an interesting addition to the complement of probabilistic planning domains. In this paper, we study Klondike using several sampling-based planning approaches including UCT, hindsight optimization, and sparse sampling, and establish lower bounds on their performance. We also introduce novel combinations of these approaches and evaluate them in Klondike. We provide a theoretical bound on the sample complexity of a method that naturally combines sparse sampling and UCT. Our results demonstrate that there is a policy that within tight confidence intervals wins over 35% of Klondike games. This result is the first reported lower bound of an optimal Klondike policy.
Continuous Orchestration of Web Services via Planning
Bertoli, Piergiorgio (Fondazione Bruno Kessler) | Kazhamiakin, Raman (Fondazione Bruno Kessler) | Paolucci, Massimo (DoCoMo Euro-Labs) | Pistore, Marco (Fondazione Bruno Kessler) | Raik, Heorhi (Fondazione Bruno Kessler) | Wagner, Matthias (DoCoMo Euro-Labs)
In this paper we realize the synthesis of continuous coordinations By envisaging standards to publish and access services over based on the conceptual framework of (Pistore, the Web, the Service-Oriented Computing (SOC) paradigm Traverso, and Bertoli 2005), which recasts the composition promises a novel degree of interoperability between distributed problem in terms of planning; namely, we act at its core applications that realize business processes. One by adopting a very simple, yet expressive requirements language, cornerstone of SOC stands in the provision of novel and and devising a novel planning algorithm. In particular, more complex business logics by the coordination of existing the requirement language expresses coordination constraints services. Due to the complexity of manually realizing that are transformed into preference-ordered maintenability such coordinations, automatedly supporting the synthesis goals, and the algorithm deals with such goals in of service orchestrations is crucial to the actual enactment the presence of exogenous events (which encode independent of SOC. This problem is extremely hard since, asynchronous evolutions of services).
Improving Planning Performance Using Low-Conflict Relaxed Plans
Baier, Jorge A. (University of Toronto) | Botea, Adi (NICTA and The Australian National University)
The FF relaxed plan heuristic is one of the most effective techniques in domain-independent satisficing planning and is used by many state-of-the-art heuristic-search planners. However, it may sometimes provide quite inaccurate information, since its relaxation strategy, which ignores the delete effects of actions, may oversimplify a problem's structure. In this paper, we propose a novel algorithm for computing relaxed plans which — although still relaxed — aim at respecting much of the structure of the original problem. We accomplish this by generating relaxed plans with a reduced number of conflicts. An action a will add a conflict when added to a relaxed plan if the resulting plan is provably illegal (i.e, not executable) in the un-relaxed problem. As a second contribution, we propose a new lookahead strategy, in the spirit of Vidal's YAHSP lookahead, that can better exploit the contents of relaxed plans. In our experimental analysis, we show that the resulting heuristic improves over the FF heuristic in a number of domains, most notably when lookahead is enabled. Moreover, the resulting system, which uses our new lookahead, is competitive with state-of-the-art planners, and even better in terms of the number of solved problems.
Incremental Policy Generation for Finite-Horizon DEC-POMDPs
Amato, Christopher (University of Massachusetts, Amherst) | Dibangoye, Jilles Steeve (Laval University) | Zilberstein, Shlomo (University of Massachusetts, Amherst)
Solving multiagent planning problems modeled as DEC-POMDPs is an important challenge. These models are often solved by using dynamic programming, but the high resource usage of current approaches results in limited scalability. To improve the efficiency of dynamic programming algorithms, we propose a new backup algorithm that is based on a reachability analysis of the state space. This method, which we call incremental policy generation, can be used to produce an optimal solution for any possible initial state or further scalability can be achieved by making use of a known start state. When incorporated into the optimal dynamic programming algorithm, our experiments show that planning horizon can be increased due to a marked reduction in resource consumption. This approach also fits nicely with approximate dynamic programming algorithms. To demonstrate this, we incorporate it into the state-of-the-art PBIP algorithm and show significant performance gains. The results suggest that the performance of other dynamic programming algorithms for DEC-POMDPs could be similarly improved by integrating the incremental policy generation approach.
A Bernstein-type inequality for stochastic processes of quadratic forms of Gaussian variables
We introduce a Bernstein-type inequality which serves to uniformly control quadratic forms of gaussian variables. The latter can for example be used to derive sharp model selection criteria for linear estimation in linear regression and linear inverse problems via penalization, and we do not exclude that its scope of application can be made even broader.