AITopics

Journal of Artificial Intelligence ResearchOct-7-2014

Distributed Heuristic Forward Search for Multi-agent Planning

Nissim, R., Brafman, R.

This paper deals with the problem of classical planning for multiple cooperative agents who have private information about their local state and capabilities they do not want to reveal. Two main approaches have recently been proposed to solve this type of problem -- one is based on reduction to distributed constraint satisfaction, and the other on partial-order planning techniques. In classical single-agent planning, constraint-based and partial-order planning techniques are currently dominated by heuristic forward search. The question arises whether it is possible to formulate a distributed heuristic forward search algorithm for privacy-preserving classical multi-agent planning. Our work provides a positive answer to this question in the form of a general approach to distributed state-space search in which each agent performs only the part of the state expansion relevant to it. The resulting algorithms are simple and efficient -- outperforming previous algorithms by orders of magnitude -- while offering similar flexibility to that of forward-search algorithms for single-agent planning. Furthermore, one particular variant of our general approach yields a distributed version of the A* algorithm that is the first cost-optimal distributed algorithm for privacy-preserving planning.

agent, algorithm, public action, (16 more...)

doi: 10.1613/jair.4295

10909

Country:

North America > United States (0.14)
Asia > Middle East > Israel (0.04)

Industry:

Transportation (1.00)
Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.66)

Bäckström, C., Jonsson, A., Jonsson, P.

Automaton Plans

Journal of Artificial Intelligence ResearchSep-30-2014

Macros have long been used in planning to represent subsequences of operators. Macros can be used in place of individual operators during search, sometimes reducing the effort required to find a plan to the goal. Another use of macros is to compactly represent long plans. In this paper we introduce a novel solution concept called automaton plans in which plans are represented using hierarchies of automata. Automaton plans can be viewed as an extension of macros that enables parameterization and branching. We provide several examples that illustrate how automaton plans can be useful, both as a compact representation of exponentially long plans and as an alternative to sequential solutions in benchmark domains such as Logistics and Grid. We also compare automaton plans to other compact plan representations from the literature, and find that automaton plans are strictly more expressive than macros, but strictly less expressive than HTNs and certain representations allowing efficient sequential access to the operators of the plan.

automaton plan, operator, representation, (12 more...)

doi: 10.1613/jair.4391

10908

Country:

Asia > Vietnam > Hanoi > Hanoi (0.04)
Europe > Sweden > Östergötland County > Linköping (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report > Promising Solution (0.48)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Sturtevant, Nathan R. (University of Denver)

The Grid-Based Path Planning Competition

AI MagazineSep-29-2014

While there have been many papers published on path planning in grids, there has not been significant work on comparing existing approaches, and it is difficult to evaluate new work in comparison to existing work. After creating a public repository of grid-based path planning problems we created the grid-based planning competition (GPPC) to facilitate these comparisons. This article describes the motivation and design of the competition, as well as plans for the future of the competition.

artificial intelligence, competition, management and information, (4 more...)

AI Magazine

Genre: Contests & Prizes (0.73)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

AAAI News

Hamilton, Carol (AAAI)

AI MagazineSep-29-2014

Participants Intelligence (AAAI-15) and the Twenty-Seventh Conference in the AAAI-15 Robotics Exhibition and the on Innovative Applications of Artificial Intelligence AAAI-15 Video Competition are encouraged to contribute (IAAI-15) will be held January 25-29 at the to the Demonstration Program with their systems, Hyatt Regency Austin in Austin, Texas, USA. AAAI is working October 8 (Papers Due) closely with the local AI community to create opportunities The Senior Member Track provides an opportunity for attendees to experience AI in Texas! Attendees for established researchers in the AI community to can also enjoy nearly 200 music venues that feature give a broad talk on a well-developed body of everything from rock and blues to country and research, an important new research area, or a promising jazz every night of the week. Austin cuisine has new topic. This year, new "Blue Sky Ideas" track expanded from barbecue and Tex-Mex to award-winning is seeking presentations aimed at presenting ideas and inventive international cuisine, and blossomed and visions that can stimulate the research community beyond brick-and-mortar restaurants to a to pursue new directions, such as new problems, vibrant, citywide food truck movement.

artificial intelligence, machine learning, natural language, (17 more...)

AI Magazine

Country:

North America > United States > California (1.00)
North America > United States > Texas > Travis County > Austin (0.34)

Genre:

Personal > Honors (1.00)
Instructional Material (0.68)

Industry:

Information Technology (0.94)
Education > Educational Setting > Online (0.93)
Leisure & Entertainment > Games (0.93)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(3 more...)

Sturtevant, Nathan R. (University of Denver)

The Grid-Based Path Planning Competition

AI MagazineSep-29-2014

artificial intelligence, competition, planning & scheduling, (12 more...)

AI Magazine

Country: North America > United States > California (0.29)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)

Feldman, Z., Domshlak, C.

Simple Regret Optimization in Online Planning for Markov Decision Processes

Journal of Artificial Intelligence ResearchSep-25-2014

We consider online planning in Markov decision processes (MDPs). In online planning, the agent focuses on its current state only, deliberates about the set of possible policies from that state onwards and, when interrupted, uses the outcome of that exploratory deliberation to choose what action to perform next. Formally, the performance of algorithms for online planning is assessed in terms of simple regret, the agent's expected performance loss when the chosen action, rather than an optimal one, is followed. To date, state-of-the-art algorithms for online planning in general MDPs are either best effort, or guarantee only polynomial-rate reduction of simple regret over time. Here we introduce a new Monte-Carlo tree search algorithm, BRUE, that guarantees exponential-rate and smooth reduction of simple regret. At a high level, BRUE is based on a simple yet non-standard state-space sampling scheme, MCTS2e, in which different parts of each sample are dedicated to different exploratory objectives. We further extend BRUE with a variant of ``learning by forgetting.'' The resulting parametrized algorithm, BRUE(alpha), exhibits even more attractive formal guarantees than BRUE. Our empirical evaluation shows that both BRUE and its generalization, BRUE(alpha), are also very effective in practice and compare favorably to the state-of-the-art.

algorithm, brue, simple regret optimization, (13 more...)

doi: 10.1613/jair.4432

10905

Country:

North America > United States > New York (0.04)
North America > United States > California (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(3 more...)

Industry: Leisure & Entertainment > Games (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.70)

arXiv.org Artificial IntelligenceSep-17-2014

A Tabu Search Algorithm for the Multi-period Inspector Scheduling Problem

Qin, Hu, Zhang, Zizhen, Xie, Yubin, Lim, Andrew

This paper introduces a multi-period inspector scheduling problem (MPISP), which is a new variant of the multi-trip vehicle routing problem with time windows (VRPTW). In the MPISP, each inspector is scheduled to perform a route in a given multi-period planning horizon. At the end of each period, each inspector is not required to return to the depot but has to stay at one of the vertices for recuperation. If the remaining time of the current period is insufficient for an inspector to travel from his/her current vertex $A$ to a certain vertex B, he/she can choose either waiting at vertex A until the start of the next period or traveling to a vertex C that is closer to vertex B. Therefore, the shortest transit time between any vertex pair is affected by the length of the period and the departure time. We first describe an approach of computing the shortest transit time between any pair of vertices with an arbitrary departure time. To solve the MPISP, we then propose several local search operators adapted from classical operators for the VRPTW and integrate them into a tabu search framework. In addition, we present a constrained knapsack model that is able to produce an upper bound for the problem. Finally, we evaluate the effectiveness of our algorithm with extensive experiments based on a set of test instances. Our computational results indicate that our approach generates high-quality solutions.

algorithm, artificial intelligence, planning & scheduling, (16 more...)

arXiv.org Artificial Intelligence

1409.5166

Country:

North America > United States (0.67)
Asia > China > Guangdong Province (0.28)

Genre: Research Report (0.81)

Industry:

Consumer Products & Services > Travel (1.00)
Transportation > Freight & Logistics Services (0.89)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.84)

Journal of Artificial Intelligence ResearchAug-31-2014

Belief Tracking for Planning with Sensing: Width, Complexity and Approximations

Bonet, B., Geffner, H.

We consider the problem of belief tracking in a planning setting where states are valuations over a set of variables that are partially observable, and beliefs stand for the sets of states that are possible. While the problem is intractable in the worst case, it has been recently shown that in deterministic conformant and contingent problems, belief tracking is exponential in a width parameter that is often bounded and small. In this work, we extend these results in two ways. First, we introduce a width notion that applies to non-deterministic problems as well, develop a factored belief tracking algorithm that is exponential in the problem width, and show how it applies to existing benchmarks. Second, we introduce a meaningful, powerful, and sound approximation scheme, beam tracking, that is exponential in a smaller parameter, the problem causal width, and has much broader applicability. We illustrate the value of this algorithm over large instances of problems such as Battleship, Minesweeper, and Wumpus, where it yields state-of-the-art performance in real-time.

artificial intelligence, logic & formal reasoning, machine learning, (19 more...)

doi: 10.1613/jair.4475

10901

Country:

Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > Canada > Ontario > Toronto (0.04)
(15 more...)

Industry: Government > Military > Navy (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Belief Revision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

Barreto, A. M. S., Pineau, J., Precup, D.

Policy Iteration Based on Stochastic Factorization

Journal of Artificial Intelligence ResearchAug-20-2014

When a transition probability matrix is represented as the product of two stochastic matrices, one can swap the factors of the multiplication to obtain another transition matrix that retains some fundamental characteristics of the original. Since the derived matrix can be much smaller than its precursor, this property can be exploited to create a compact version of a Markov decision process (MDP), and hence to reduce the computational cost of dynamic programming. Building on this idea, this paper presents an approximate policy iteration algorithm called policy iteration based on stochastic factorization, or PISF for short. In terms of computational complexity, PISF replaces standard policy iteration's cubic dependence on the size of the MDP with a function that grows only linearly with the number of states in the model. The proposed algorithm also enjoys nice theoretical properties: it always terminates after a finite number of iterations and returns a decision policy whose performance only depends on the quality of the stochastic factorization. In particular, if the approximation error in the factorization is sufficiently small, PISF computes the optimal value function of the MDP. The paper also discusses practical ways of factoring an MDP and illustrates the usefulness of the proposed algorithm with an application involving a large-scale decision problem of real economical interest.

algorithm, factorization, mdp, (17 more...)

doi: 10.1613/jair.4301

10897

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
Asia > Middle East > Jordan (0.04)
(2 more...)

Genre:

Research Report (0.46)
Overview (0.46)

Industry: Energy > Power Industry (0.45)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)