AITopics | Planning & Scheduling

Collaborating Authors

Planning & Scheduling

"Planning is the process of generating (possibly partial) representations of future behavior prior to the use of such plans to constrain or control that behavior. The outcome is usually a set of actions, with temporal and other constraints on them, for execution by some agent or agents. As a core aspect of human intelligence, planning has been studied since the earliest days of AI and cognitive science. Planning research has led to many useful tools for real-world applications, and has yielded significant insights into the organization of behavior and the nature of reasoning about actions."
– Planning entry by Austin Tate in the MIT Encyclopedia of Cognitive Science.

News Overviews Instructional Materials AI-Alerts Classics

DESPOT: Online POMDP Planning with Regularization

Somani, Adhiraj, Ye, Nan, Hsu, David, Lee, Wee Sun

Neural Information Processing SystemsDec-31-2013

POMDPs provide a principled framework for planning under uncertainty, but are computationally intractable, due to the “curse of dimensionality” and the “curse of history”. This paper presents an online lookahead search algorithm that alleviates these difficulties by limiting the search to a set of sampled scenarios. The execution of all policies on the sampled scenarios is summarized using a Determinized Sparse Partially Observable Tree (DESPOT), which is a sparsely sampled belief tree. Our algorithm, named Regularized DESPOT (R-DESPOT), searches the DESPOT for a policy that optimally balances the size of the policy and the accuracy on its value estimate obtained through sampling. We give an output-sensitive performance bound for all policies derived from the DESPOT, and show that R-DESPOT works well if a small optimal policy exists. We also give an anytime approximation to R-DESPOT. Experiments show strong results, compared with two of the fastest online POMDP algorithms.

artificial intelligence, machine learning, planning & scheduling, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Bayesian Mixture Modelling and Inference based Thompson Sampling in Monte-Carlo Tree Search

Bai, Aijun, Wu, Feng, Chen, Xiaoping

Neural Information Processing SystemsDec-31-2013

Monte-Carlo tree search is drawing great interest in the domain of planning under uncertainty, particularly when little or no domain knowledge is available. One of the central problems is the trade-off between exploration and exploitation. In this paper we present a novel Bayesian mixture modelling and inference based Thompson sampling approach to addressing this dilemma. The proposed Dirichlet-NormalGamma MCTS (DNG-MCTS) algorithm represents the uncertainty of the accumulated reward for actions in the MCTS search tree as a mixture of Normal distributions and inferences on it in Bayesian settings by choosing conjugate priors in the form of combinations of Dirichlet and NormalGamma distributions. Thompson sampling is used to select the best action at each decision node. Experimental results show that our proposed algorithm has achieved the state-of-the-art comparing with popular UCT algorithm in the context of online planning for general Markov decision processes.

artificial intelligence, bayesian inference, machine learning, (19 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.34)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)
(2 more...)

Add feedback

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

Guez, Arthur, Silver, David, Dayan, Peter

arXiv.org Artificial IntelligenceDec-18-2013

Bayesian model-based reinforcement learning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, finding the resulting Bayes-optimal policies is notoriously taxing, since the search space becomes enormous. In this paper we introduce a tractable, sample-based method for approximate Bayes-optimal planning which exploits Monte-Carlo tree search. Our approach outperformed prior Bayesian model-based RL algorithms by a significant margin on several well-known benchmark problems -- because it avoids expensive applications of Bayes rule within the search tree by lazily sampling models from the current beliefs. We illustrate the advantages of our approach by showing it working in an infinite state space domain which is qualitatively out of reach of almost all previous work in Bayesian exploration.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

1205.3109

Country: North America (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
(2 more...)

Add feedback

Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search

Guez, A., Silver, D., Dayan, P.

Journal of Artificial Intelligence ResearchNov-30-2013

Bayesian planning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, planning optimally in the face of uncertainty is notoriously taxing, since the search space is enormous. In this paper we introduce a tractable, sample-based method for approximate Bayes-optimal planning which exploits Monte-Carlo tree search. Our approach avoids expensive applications of Bayes rule within the search tree by sampling models from current beliefs, and furthermore performs this sampling in a lazy manner. This enables it to outperform previous Bayesian model-based reinforcement learning algorithms by a significant margin on several well-known benchmark problems. As we show, our approach can even work in problems with an infinite state space that lie qualitatively out of reach of almost all previous work in Bayesian exploration.

artificial intelligence, machine learning, reinforcement learning, (20 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.4117

AI Access Foundation

10853

Journal of Artificial Intelligence Research

Country:

Europe (0.14)
North America > United States > Massachusetts (0.14)
North America > Canada > Ontario > Toronto (0.14)

Genre:

Overview (0.46)
Research Report (0.45)

Industry: Energy > Oil & Gas > Upstream (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
(3 more...)

Add feedback

The Complexity of Optimal Monotonic Planning: The Bad, The Good, and The Causal Graph

Domshlak, C., Nazarenko, A.

Journal of Artificial Intelligence ResearchNov-30-2013

For almost two decades, monotonic, or ``delete free,'' relaxation has been one of the key auxiliary tools in the practice of domain-independent deterministic planning. In the particular contexts of both satisficing and optimal planning, it underlies most state-of-the-art heuristic functions. While satisficing planning for monotonic tasks is polynomial-time, optimal planning for monotonic tasks is NP-equivalent. Here we establish both negative and positive results on the complexity of some wide fragments of optimal monotonic planning, with the fragments being defined around the causal graph topology. Our results shed some light on the link between the complexity of general optimal planning and the complexity of optimal planning for the respective monotonic relaxations.

causal graph, fdr task, optimal, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.4145

AI Access Foundation

10852

Journal of Artificial Intelligence Research

Country:

North America > United States > New York (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)
Asia > Middle East > Israel > Haifa District > Haifa (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)

Add feedback

Generating Believable Stories in Large Domains

Kartal, Bilal (University of Minnesota) | Koenig, John (University of Minnesota) | Guy, Stephen J. (University of Minnesota)

AAAI ConferencesNov-10-2013

Planning-based techniques are a very powerful tool for automated story generation. However, as the number of possible actions increases, traditional planning techniques suffer from a combinatorial explosion due to large branching factors. In this work, we apply Monte Carlo Tree Search (MCTS) techniques to generate stories in domains with large numbers of possible actions (100+). Our approach employs a Bayesian story evaluation method to guide the planning towards believable stories that reach a user defined goal. We generate stories in a novel domain with different type of story goals. Our approach shows an order of magnitude improvement in performance over traditional search techniques.

artificial intelligence, generating believable story, planning & scheduling

AAAI Conferences

Ninth Artificial Intelligence and Interactive Digital Entertainment Conference

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.53)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.53)

Add feedback

Automated Generation of Diverse NPC-Controlling FSMs Using Nondeterministic Planning Techniques

Coman, Alexandra (Ohio Northern University) | Munoz-Avila, Hector (Lehigh University)

AAAI ConferencesNov-10-2013

We study the problem of generating a set of Finite State Machines (FSMs) modeling the behavior of multiple, distinct NPCs. We observe that nondeterministic planning techniques can be used to generate FSMs by following conventions typically used when manually creating FSMs modeling NPC behavior. We implement our ideas in DivNDP, the first algorithm for automated diverse FSM generation.

artificial intelligence, nondeterministic planning technique, planning & scheduling, (2 more...)

AAAI Conferences

Ninth Artificial Intelligence and Interactive Digital Entertainment Conference

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.60)

Add feedback

Integrating Monte Carlo Tree Search with Knowledge-Based Methods to Create Engaging Play in a Commercial Mobile Game

Whitehouse, Daniel (University of York) | Cowling, Peter I. (University of York) | Powley, Edward J. (University of York) | Rollason, Jeff (AI Factory Ltd.)

AAAI ConferencesNov-10-2013

Monte Carlo Tree Search (MCTS) has produced many recent breakthroughs in game AI research, particularly in computer Go. In this paper we consider how MCTS can be applied to create engaging AI for a popular commercial mobile phone game: Spades by AI Factory, which has been downloaded more than 2.5 million times. In particular, we show how MCTS can be integrated with knowledge-based methods to create an interesting, fun and strong player which makes far fewer plays that could be perceived by human observers as blunders than MCTS without the injection of knowledge. These blunders are particularly noticeable for Spades, where a human player must co-operate with an AI partner. MCTS gives objectively stronger play than the knowledge-based approach used in previous versions of the game and offers the flexibility to customise behaviour whilst maintaining a reusable core, with a reduced development cycle compared to purely knowledge-based techniques.

artificial intelligence, expert system, planning & scheduling, (4 more...)

AAAI Conferences

Ninth Artificial Intelligence and Interactive Digital Entertainment Conference

Industry: Leisure & Entertainment > Games (0.53)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.60)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.60)

Add feedback

OnDroad Planner: Building Tourist Plans Using Traveling Social Network Information

Cenamor, Isabel (Universidad Carlos III de Madrid) | Rosa, Tomás de la (Universidad Carlos III de Madrid) | Borrajo, Daniel (Universidad Carlos III de Madrid)

AAAI ConferencesNov-5-2013

One of the key challenges in automated planning is to define the sources of information that will feed the initial state and goals of each planning task. In many domains, the information comes from company's databases. In other applications, the information is harder to obtain and it is usually partial. In this paper, we will describe an application on travel planning, where the initial state and goals will be obtained by crowdsourcing. Travel planning requires the use of plenty Internet-based resources; some of them are related to human generated opinions on all kinds of matters (e.g. hotels, places to visit, restaurants, ...). We present the OnDroad planner, a system that creates personalized tourist plans using the human generated information gathered from the minube traveling social network. OnDroad proposes an initial tourist guide according to the recommendation of the users profiles and their contacts. In addition, this guide can be continuously updated with newly generated data.

building tourist plan, ondroad planner, social network information

AAAI Conferences

First AAAI Conference on Human Computation and Crowdsourcing

Industry: Information Technology > Services (0.60)

Technology:

Information Technology > Communications > Social Media (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.73)

Add feedback

Herding the Crowd: Automated Planning for Crowdsourced Planning

Talamadupula, Kartik (Arizona State University) | Kambhampati, Subbarao (Arizona State University) | Hu, Yuheng (Arizona State University) | Nguyen, Tuan Anh (Arizona State University) | Zhuo, Hankz Hankui (Sun Yat-sen University, Guangzhou, China)

AAAI ConferencesNov-5-2013

An important application of human computation is crowdsourced planning and scheduling. In this paper, we present an architecture for an automated system that can significantly improve the effectiveness of the crowd in collaborating and coming up with effective plans by herding it. We define two main problems that have to be solved when designing such automated crowd-herding systems: interpretation, and steering; and discuss how automated planning techniques can be used to solve these problems.

artificial intelligence, crowdsourced planning, social media, (3 more...)

AAAI Conferences

First AAAI Conference on Human Computation and Crowdsourcing

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Add feedback