Goto

Collaborating Authors

 Planning & Scheduling


Planning under risk and uncertainty based on Prospect-theoretic models

arXiv.org Artificial Intelligence

In this work, we develop a novel sampling-based motion planing approach to generate plans in a risky and uncertain environment. To model a variety of risk-sensitivity profiles, we propose an adaption of Cumulative Prospect Theory (CPT) to the setting of path planning. This leads to the definition of a non-rational continuous cost envelope (as well as a continuous uncertainty envelope) associated with an obstacle environment. We use these metrics along with standard costs like path length to formulate path planning problems. Building on RRT*, we then develop a sampling-based motion planner that generates desirable paths from the perspective of a given risk sensitive profile. Since risk sensitivity can greatly vary, we provide a tuning knob to appease a diversity of decision makers (DM), ranging from totally risk-averse to risk-indifferent. Additionally, we adapt a Simultaneous Perturbation Stochastic Approximation (SPSA)-based algorithm to learn the CPT parameters that can best represent a certain DM. Simulations are presented in a 2D environment to evaluate the modeling approach and algorithm's performance.


No AI Is an Island: The Case for Teaming Intelligence

AI Magazine

The purpose of this article is to draw attention to an aspect of intelligence that has not yet received significant attention from the AI community, but that plays a crucial role in a technologyโ€™s effectiveness in the world, namely teaming intelligence. We propose that Al will reach its full potential only if, as part of its intelligence, it also has enough teaming intelligence to work well with people. Although seemingly counterintuitive, the more intelligent the technological system, the greater the need for collaborative skills. This paper will argue why teaming intelligence is important to AI, provide a general structure for AI researchers to use in developing intelligent systems that team well, assess the current state of the art and, in doing so, suggest a path forward for future AI systems. This is not a call to develop a new capability, but rather, an approach to what AI capabilities should be built, and how, so as to imbue intelligent systems with teaming competence.


What you get is what you see: Decomposing Epistemic Planning using Functional STRIPS

arXiv.org Artificial Intelligence

Epistemic planning --- planning with knowledge and belief --- is essential in many multi-agent and human-agent interaction domains. Most state-of-the-art epistemic planners solve this problem by compiling to propositional classical planning, for example, generating all possible knowledge atoms, or compiling epistemic formula to normal forms. However, these methods become computationally infeasible as problems grow. In this paper, we decompose epistemic planning by delegating reasoning about epistemic formula to an external solver. We do this by modelling the problem using \emph{functional STRIPS}, which is more expressive than standard STRIPS and supports the use of external, black-box functions within action models. Exploiting recent work that demonstrates the relationship between what an agent `sees' and what it knows, we allow modellers to provide new implementations of externals functions. These define what agents see in their environment, allowing new epistemic logics to be defined without changing the planner. As a result, it increases the capability and flexibility of the epistemic model itself, and avoids the exponential pre-compilation step. We ran evaluations on well-known epistemic planning benchmarks to compare with an existing state-of-the-art planner, and on new scenarios based on different external functions. The results show that our planner scales significantly better than the state-of-the-art planner against which we compared, and can express problems more succinctly.


Learning to Plan via Neural Exploration-Exploitation Trees

arXiv.org Machine Learning

Planning paths efficiently in a high-dimensional continuous state and action space is a fundamental yet challenging problem in many real-world applications, such as robot manipulation and autonomous driving. Since the general path planning problem is PSPACE-complete (Reif, 1979), one typically resorts to approximate or heuristic algorithms. Sampling-based planning algorithms, such as probabilistic roadmaps (PRM) (Kavraki et al., 1996), rapidlyexploring random trees (RRT) (LaValle, 1998), and their variants (Karaman & Frazzoli, 2011), provide principled approximate solutions to a wide spectrum of high-dimensional path planning tasks. However, these generic algorithms typically employ a uniform proposal distribution for sampling which does not make use of the structures of the problem at hand and thus may require lots of samples to obtain an initial feasible solution path for complicated tasks, e.g., a narrow passage in a map. To improve the sample efficiency, researchers designed algorithms to take problem structures into account, such as the Gaussian sampler (Boor et al., 1999), the bridge test (Hsu et al., 2003), the reachability-guided sampler (Shkolnik et al., 2009), the


Using SMT Solvers to Validate Models for AI Problems

arXiv.org Artificial Intelligence

Artificial Intelligence problems, ranging form planning/scheduling up to game control, include an essential crucial step: describing a model which accurately defines the problem's required data, requirements, allowed transitions and established goals. The ways in which a model can fail are numerous and often lead to a failure of search strategies to provide a quick, optimal, or even any solution. This paper proposes using SMT (Satisfiability Modulo Theories) solvers, such as Z3, to check the validity of a model. We propose two tests: checking whether a final(goal) state exists in the model's described problem space and checking whether the transitions described can provide a path from the identified initial states to any the goal states (meaning a solution has been found). The advantage of using an SMT solver for AI model checking is that they substitute actual search strategies and they work over an abstract representation of the model, that is, a set of logical formulas. Reasoning at an abstract level is not as expensive as exploring the entire solution space. SMT solvers use efficient decision procedures which provide proofs for the logical formulas corresponding to the AI model. A recent addition to Z3 allowed us to describe sequences of transitions as a recursive function, thus we can check if a solution can be found in the defined model.


Why Couldn't You do that? Explaining Unsolvability of Classical Planning Problems in the Presence of Plan Advice

arXiv.org Artificial Intelligence

Explainable planning is widely accepted as a prerequisite for autonomous agents to successfully work with humans. While there has been a lot of research on generating explanations of solutions to planning problems, explaining the absence of solutions remains an open and under-studied problem, even though such situations can be the hardest to understand or debug. In this paper, we show that hierarchical abstractions can be used to efficiently generate reasons for unsolvability of planning problems. In contrast to related work on computing certificates of unsolvability, we show that these methods can generate compact, human-understandable reasons for unsolvability. Empirical analysis and user studies show the validity of our methods as well as their computational efficacy on a number of benchmark planning domains.


AI & Architecture

#artificialintelligence

Our work proposes to evidence this promise when applied to the built environment. Specifically, we offer to apply AI to floor plans analysis and generation. Our methodology follows two main intuitions (1) the creation of building plans is a non-trivial technical challenge, although encompassing standard optimization technics, and (2) the design of space is a sequential process, requiring successive design steps across different scales (urban scale, building scale, unit scale). Then, in order to harness these two realities, we have chosen nested Generative Adversarial Neural Networks or GANs. Such models enable us to capture more complexity across encountered floor plans and to break down the complexity by tackling problems through successive steps.


Incremental Learning of Discrete Planning Domains from Continuous Perceptions

arXiv.org Artificial Intelligence

We propose a framework for learning discrete deterministic planning domains. In this framework, an agent learns the domain by observing the action effects through continuous features that describe the state of the environment after the execution of each action. Besides, the agent learns its perception function, i.e., a probabilistic mapping between state variables and sensor data represented as a vector of continuous random variables called perception variables. We define an algorithm that updates the planning domain and the perception function by (i) introducing new states, either by extending the possible values of state variables, or by weakening their constraints; (ii) adapts the perception function to fit the observed data (iii) adapts the transition function on the basis of the executed actions and the effects observed via the perception function. The framework is able to deal with exogenous events that happen in the environment.


Computing the Scope of Applicability for Acquired Task Knowledge in Experience-Based Planning Domains

arXiv.org Artificial Intelligence

Experience-based planning domains have been proposed to improve problem solving by learning from experience. They rely on acquiring and using task knowledge, i.e., activity schemata, for generating solutions to problem instances in a class of tasks. Using Three-Valued Logic Analysis (TVLA), we extend previous work to generate a set of conditions that determine the scope of applicability of an activity schema. The inferred scope is a bounded representation of a set of problems of potentially unbounded size, in the form of a 3-valued logical structure, which is used to automatically find an applicable activity schema for solving task problems. We validate this work in two classical planning domains.


How to apply Reinforcement Learning to real life planning problems

#artificialintelligence

To avoid the paper being thrown in the bin we provide this with a large, negative reward, say -1, and because the teacher is pleased with it being placed in the bin this nets a large positive reward, 1. To avoid the outcome where it continually gets passed around the room, we set the reward for all other actions to be a small, negative value, say -0.04. If we set this as a positive or null number then the model may let the paper go round and round as it would be better to gain small positives than risk getting close to the negative outcome. This number is also very small as it will only collect a single terminal reward but it could take many steps to end the episode and we need to ensure that, if the paper is place in the bin, the positive outcome is not cancelled out. Please note: the rewards are always relative to one another and I have chosen arbitrary figures, but these can be changed if the results are not as desired.