Country
ReTrASE: Integrating Paradigms for Approximate Probabilistic Planning
Kolobov, Andrey (University of Washington, Seattle) | Mausam, (University of Washington, Seattle) | Weld, Daniel S. (University of Washington, Seattle)
Past approaches for solving MDPs have several weaknesses: 1) Decision-theoretic computation over the state space can yield optimal results but scales poorly. 2) Value-function approximation typically requires human-specified basis functions and has not been shown successful on nominal ("discrete") domains such as those in the ICAPS planning competitions. 3) Replanning by applying a classical planner to a determinized domain model can generate approximate policies for very large problems but has trouble handling probabilistic subtlety. This paper presents ReTrASE, a novel MDP solver, which combines decision theory, function approximation and classical planning in a new way. ReTrASE uses classical planning to create basis functions for value-function approximation and applies expected-utility analysis to this compact space. Our algorithm is memory-efficient and fast (due to its compact, approximate representation), returns high-quality solutions (due to the decision-theoretic framework) and does not require additional knowledge from domain engineers (since we apply classical planning to automatically construct the basis functions). Experiments demonstrate that ReTrASE outperforms winners from the past three probabilistic-planning competitions on many hard problems.
Trees of Shortest Paths Versus Steiner Trees: Understanding and Improving Delete Relaxation Heuristics
Keyder, Emil Ragip (Universitat Pompeu Fabra) | Geffner, Hector (ICREA &)
Heuristic search using heuristics extracted from the delete relaxation is one of the most effective methods in planning. Since finding the optimal solution of the delete relaxation is intractable, various heuristics introduce independence assumptions, the implications of which are not yet fully understood. Here we use concepts from graph theory to show that in problems with unary action preconditions, the delete relaxation is closely related to the Steiner Tree problem, and that the independence assumption for the set of goals results in a tree-of-shortest-paths approximation. We analyze the limitations of this approximation and develop an alternative method for computing relaxed plans that addresses them. The method is used to guide a greedy best-first search, where it is shown to improve plan quality and coverage over several benchmark domains.
Cost-Optimal Planning with Landmarks
Karpas, Erez (Technion) | Domshlak, Carmel (Technion)
Recently, Richter et al. [2008] proposed a novel some point in every solution plan. Previous work way of using a set of landmarks as a pseudo-heuristic within has very successfully exploited planning landmarks a satisficing heuristic search. This technique allowed both in satisficing (non-optimal) planning. We propose a reducing the length of the generated plans, as well as improving methodology for deriving admissible heuristic estimates success rate both with respect to the iterative approach for cost-optimal planning from a set of planning of Hoffmann et al., and with respect to other stateof-the-art landmarks. The resulting heuristics fall into a satisficing planners. In particular, the LAMA planner novel class of multi-path dependent heuristics, and by Richter and Westphal utilizing such a landmarks-based we present a simple best-first search procedure exploiting heuristic search was the clear winner of the Sequential Satisficing such heuristics.
Structured Plans and Observation Reduction for Plans with Contexts
Huang, Wei (South China University of Technology) | Wen, Zhonghua (Xiangtan University) | Jiang, Yunfei (Sun Yat-sen University) | Peng, Hong (South China University of Technology)
In many real world planning domains, some observation information is optional and useless to the execution of a plan; on the other hand, information acquisition may require some kind of cost. The problem of observation reduction for strong plans has been addressed in the literature. However, observation reduction for plans with contexts (which are more general and useful than strong plans in robotics) is still a open problem. In this paper, we present an attempt to solve the problem. Our first contribution is the definition of structured plans, which can encode sequential, conditional and iterative behaviors, and is expressive enough for dealing with incomplete observation information and internal states of the agent. A second contribution is an observation reduction algorithm for plans with contexts, which can transform a plan with contexts into a structured plan that only branches on necessary observation information.
Abnormal Activity Recognition based on HDP-HMM Models
Hu, Derek Hao (Hong Kong University of Science and Technology) | Zhang, Xian-Xing (Nanjing University) | Yin, Jie (CSIRO ICT Centre) | Zheng, Vincent Wenchen (Hong Kong University of Science and Technology) | Yang, Qiang (Hong Kong University of Science and Technology)
Detecting abnormal activities from sensor readings is an important research problem in activity recognition. A number of different algorithms have been proposed in the past to tackle this problem. Many of the previous state-based approaches suffer from the problem of failing to decide the appropriate number of states, which are difficult to find through a trial and-error approach, in real-world applications. In this paper, we propose an accurate and flexible framework for abnormal activity recognition from sensor readings that involves less human tuning of model parameters. Our approach first applies a Hierarchical Dirichlet Process Hidden Markov Model (HDP-HMM), which supports an infinite number of states, to automatically find an appropriate number of states. We incorporate a Fisher Kernel into the One-Class Support Vector Machine (OCSVM) model to filter out the activities that are likely to be normal. Finally, we derive an abnormal activity model from the normal activity models to reduce false positive rate in an unsupervised manner. Our main contribution is that our proposed HDP-HMM models can decide the appropriate number of states automatically, and that by incorporating a Fisher Kernel into the OCSVM model, we can combine the advantages from generative model and discriminative model. We demonstrate the effectiveness of our approach by using several real-world datasets to test our algorithm’s performance.
Learning Hierarchical Task Networks for Nondeterministic Planning Domains
Hogg, Chad (Lehigh University) | Kuter, Ugur (University Of Maryland) | Munoz-Avila, Hector (Lehigh University)
This paper describes how to learn Hierarchical Task Networks (HTNs) in nondeterministic planning domains, where actions may have multiple possible outcomes. We discuss several desired properties that guarantee that the resulting HTNs will correctly handle the nondeterminism in the domain. We developed a new learning algorithm, called ND-HTN-Maker, that exploits these properties. We implemented ND-HTN-Maker in the recently-proposed HTN-Maker system, a goal-regression based HTN learning approach. In our theoretical study, we show that ND-HTN-Maker soundly produces HTN planning knowledge in low-order polynomial times, despite the nondeterminism. In our experiments with two nondeterministic planning domains, ND-SHOP2, a well-known HTN planning algorithm for nondeterministic domains, significantly outperformed (in some cases, by about 3 orders of magnitude) the well-known planner MBP using the learned HTNs.
Delaying Commitment in Plan Recognition Using Combinatory Categorial Grammars
Geib, Christopher (University of Edinburgh)
This paper presents a new algorithm for plan recognition called ELEXIR (Engine for LEXicalized Intent Recognition). ELEXIR represents the plans to be recognized with a grammatical formalism called Combinatory Categorial Grammar(CCG). We show that representing plans with CCGs can allow us to prevent early commitment to plan goals and thereby reduce runtime.
Activity Recognition with Intended Actions
Gabaldon, Alfredo (New University of Lisbon)
The following activity recognition problem is considered: a description of the action capabilities of an agent being observed is given. This includes the preconditions and effects of atomic actions and of the activities (sequences of actions) the agent may execute. Given this description and a set of propositions, called history, about action occurrences, intended actions and properties of the world all at various points in time, the problem is to complete the picture as much as possible and determine what has already happened, what the intentions of the agent are, and what may happen as a result of the agent acting on those intentions. We present a framework to solve these activity recognition problems based on a formal language for reasoning about actions that includes a notion of intended actions, and a corresponding formalization in answer set programming.
Optimal Symbolic Planning with Action Costs and Preferences
Edelkamp, Stefan (University of Bremen) | Kissmann, Peter (TU Dortmund)
This paper studies the solving of finite-domain action planning problems with discrete action costs and soft constraints. For sequential optimal planning, a symbolic perimeter database heuristic is addressed in a bucket implementation of A*. For computing net-benefits, we propose symbolic branch-and-bound search together with some search refinements. The net-benefit we optimize is the total benefit of satisfying the goals, minus the total action cost to achieve them. This results in an objective function to be minimized that is a linear expression over the violation of the preferences added to the action cost total.
Domain-Independent, Automatic Partitioning for Probabilistic Planning
Dai, Peng (University of Washington) | Mausam, (University of Washington) | Weld, Daniel S (University of Washington)
Recent progress on external-memory MDP solvers has enabled optimal solutions to large probabilistic planning problems. However, PEMVI requires a human to manually partition the MDP before the planning algorithm can be applied — putting an added burden on the domain designer and detracting from the vision of automated planning. This paper presents a novel partitioning scheme, which automatically subdivides the state space into blocks that respect the memory constraints. Our algorithm first applies static domain analysis to identify candidates for partitioning, and then uses heuristic search to generate a good partition. We evaluate the usefulness of our method in the context of PEMVI across many benchmark domains, showing that it can successfully solve extremely large problems in each domain. We also compare the performance of automatic partitioning with previously reported results using human-designed partitions. Experiments show that our algorithm generates significantly superior partitions, which speed MDP solving and also yield vast memory savings.