Europe
Domain-Guided Novelty Detection for Autonomous Exploration
Thompson, David Ray (Caltech Jet Propulsion Laboratory)
Here novelty detection identifies salient image features to guide autonomous robotic exploration. There is little advance knowledge of the features in the scene or the proportion that should count as outliers. A new algorithm addresses this ambiguity by modeling novel data in advance and characterizing regular data at run time. Detection thresholds adapt dynamically to reduce misclassification risk while accommodating homogeneous and heterogeneous scenes. Experiments demonstrate the technique on a representative set of navigation images from the Mars Exploration Rover "Opportunity." An efficient image analysis procedure filters each image using the integral transform. Pixel-level features are aggregated into covariance descriptors that represent larger regions. Finally, a distance metric derived from generalized eigenvalues permits novelty detection with kernel density estimation. Results suggest that exploiting training examples of novel data can improve performance in this domain.
Learning Kinematic Models for Articulated Objects
Sturm, Jürgen (University of Freiburg) | Pradeep, Vijay (Willow Garage) | Stachniss, Cyrill (University of Freiburg) | Plagemann, Christian (Stanford University) | Konolige, Kurt (Willow Garage) | Burgard, Wolfram (University of Freiburg)
Robots operating in home environments must be able to interact with articulated objects such as doors or drawers. Ideally, robots are able to autonomously infer articulation models by observation. In this paper, we present an approach to learn kinematic models by inferring the connectivity of rigid parts and the articulation models for the corresponding links. Our method uses a mixture of parameterized and parameter-free (Gaussian process) representations and finds low-dimensional manifolds that provide the best explanation of the given observations. Our approach has been implemented and evaluated using real data obtained in various realistic home environment settings.
Information-Lookahead Planning for AUV Mapping
Saigol, Zeyn A. (University of Birmingham) | Dearden, Richard W. (University of Birmingham) | Wyatt, Jeremy L. (University of Birmingham) | Murton, Bramley J. (National Oceanography Centre, Southampton)
Exploration for robotic mapping is typically handled using greedy entropy reduction. Here we show how to apply information lookahead planning to a challenging instance of this problem in which an Autonomous Underwater Vehicle (AUV) maps hydrothermal vents. Given a simulation of vent behaviour we derive an observation function to turn the planning for mapping problem into a POMDP. We test a variety of information state MDP algorithms against greedy, systematic and reactive search strategies. We show that directly rewarding the AUV for visiting vents induces effective mapping strategies. We evaluate the algorithms in simulation and show that our information lookahead method outperforms the others.
Evaluating Description and Reference Strategies in a Cooperative Human-Robot Dialogue System
Foster, Mary Ellen (University of Edinburgh) | Giuliani, Manuel (Technical University of Munich) | Isard, Amy (University of Edinburgh) | Matheson, Colin (University of Edinburgh) | Oberlander, Jon (University of Edinburgh) | Knoll, Alois (Technical University of Munich)
We then describe In this paper, we describe a user evaluation of a humanrobot a study which assessed the responses of naïve users dialogue system that is designed to enable a humanoid to output that varied along two dimensions: the robot to cooperate with a human partner on building wooden method of describing an assembly plan (pre-order construction toys. In the evaluation, we experimentally vary or post-order), and the method of referring to objects two aspects of the output generated by the system: the way in the world (basic and full). Varying both that it describes assembly plans to the user, and the way that of these factors produced significant results: subjects it refers to objects in the world. We then measure the impact using the system that employed a pre-order of varying each of these features on the users' objective success description strategy asked for instructions to be repeated at working with the system, as well as on their subjective significantly less often than those who experienced impressions of the interaction.
Bayesian Real-time Dynamic Programming
Sanner, Scott (National ICT Australia) | Goetschalckx, Robby (Catholic University of Leuven) | Driessens, Kurt (Catholic University of Leuven) | Shani, Guy (Microsoft Research)
Real-time dynamic programming (RTDP) solves Markov decision processes (MDPs) when the initial state is restricted, by focusing dynamic programming on the envelope of states reachable from an initial state set. RTDP often provides performance guarantees without visiting the entire state space. Building on RTDP, recent work has sought to improve its efficiency through various optimizations, including maintaining upper and lower bounds to both govern trial termination and prioritize state exploration. In this work, we take a Bayesian perspective on these upper and lower bounds and use a value of perfect information (VPI) analysis to govern trial termination and exploration in a novel algorithm we call VPI-RTDP. VPI-RTDP leads to an improvement over state-of-the-art RTDP methods, empirically yielding up to a three-fold reduction in the amount of time and number of visited states required to achieve comparable policy performance.
Plan Recognition as Planning
Ramírez, Miquel (Universitat Pompeu Fabra) | Geffner, Hector (ICREA and Universitat Pompeu Fabra)
In this work we aim to narrow the gap between plan recognition and planning by exploiting the power and generality of recent planning algorithms for recognizing the set G ∗ of goals G that explain a sequence of observations given a domain theory. After providing a crisp definition of this set, we show by means of a suitable problem transformation that a goal G belongs to G ∗ if there is an action sequence π that is an optimal plan for both the goal G and the goal G extended with extra goals representing the observations. Exploiting this result, we show how the set G ∗ can be computed exactly and approximately by minor modifications of existing optimal and suboptimal planning algorithms, and existing polynomial heuristics. Experiments over several domains show that the suboptimal planning algorithms and the polynomial heuristics provide good approximations of the optimal goal set G ∗ while scaling up as well as state-of-the-art planning algorithms and heuristics.
Monte-Carlo Exploration for Deterministic Planning
Nakhost, Hootan (University of Alberta) | Müller, Martin (University of Alberta)
Search methods based on Monte-Carlo simulation have recently led to breakthrough performance improvements in difficult game-playing domains such as Go and General Game Playing. Monte-Carlo Random Walk (MRW) planning applies Monte-Carlo ideas to deterministic classical planning. In the forward chaining planner Arvand, Monte-Carlo random walks are used to explore the local neighborhood of a search state for action selection. In contrast to the stochastic local search approach used in the recent planner Identidem, random walks yield a larger and unbiased sample of the search neighborhood, and require state evaluations only at the endpoints of each walk. On IPC-4 competition problems, the performance of Arvand is competitive with state of the art systems.
Learning Probabilistic Hierarchical Task Networks to Capture User Preferences
Li, Nan (Arizona State University) | Kambhampati, Subbarao (Arizona State University) | Yoon, Sungwook (Arizona State University)
While much work on learning in planning focused on learning domain physics (i.e., action models), and search control knowledge, little attention has been paid towards learning user preferences on desirable plans. Hierarchical task networks (HTN) are known to provide an effective way to encode user prescriptions about what constitute good plans. However, manual construction of these methods is complex and error prone. In this paper, we propose a novel approach to learning probabilistic hierarchical task networks that capture user preferences by examining user-produced plans given no prior information about the methods (in contrast, most prior work on learning within the HTN framework focused on learning “method preconditions”—i.e., domain physics—assuming that the structure of the methods is given as input). We will show that this problem has close parallels to the problem of probabilistic grammar induction, and describe how grammar induction methods can be adapted to learn task networks. We will empirically demonstrate the effectiveness of our approach by showing that task networks we learn are able to generate plans with a distribution close to the distribution of the userpreferred plans.
Trees of Shortest Paths Versus Steiner Trees: Understanding and Improving Delete Relaxation Heuristics
Keyder, Emil Ragip (Universitat Pompeu Fabra) | Geffner, Hector (ICREA &)
Heuristic search using heuristics extracted from the delete relaxation is one of the most effective methods in planning. Since finding the optimal solution of the delete relaxation is intractable, various heuristics introduce independence assumptions, the implications of which are not yet fully understood. Here we use concepts from graph theory to show that in problems with unary action preconditions, the delete relaxation is closely related to the Steiner Tree problem, and that the independence assumption for the set of goals results in a tree-of-shortest-paths approximation. We analyze the limitations of this approximation and develop an alternative method for computing relaxed plans that addresses them. The method is used to guide a greedy best-first search, where it is shown to improve plan quality and coverage over several benchmark domains.
Delaying Commitment in Plan Recognition Using Combinatory Categorial Grammars
Geib, Christopher (University of Edinburgh)
This paper presents a new algorithm for plan recognition called ELEXIR (Engine for LEXicalized Intent Recognition). ELEXIR represents the plans to be recognized with a grammatical formalism called Combinatory Categorial Grammar(CCG). We show that representing plans with CCGs can allow us to prevent early commitment to plan goals and thereby reduce runtime.