Learning Classical Planning Strategies with Policy Gradient

Gomoluch, Pawel, Alrajeh, Dalal, Russo, Alessandra

Oct-23-2018–arXiv.org Artificial Intelligence

A common paradigm in classical planning is heuristic forward search. Forward search planners often rely on relatively simple best-first search algorithm, which remains fixed throughout the search process. In this paper, we introduce a novel search framework capable of alternating between several forward search approaches while solving a particular planning problem. Selection of the approach is performed using a trainable stochastic policy. This enables tailoring the search strategy to a particular distribution of planning problems and a selected performance metric, such as the IPC score or running time. We construct a strategy space using five search algorithms and a two-dimensional representation of the planner's state. Strategies are then trained on randomly generated planning problems using policy gradient. Experimental results show that the learner is able to discover domain-specific search strategies, thus improving the planner's performance with respect to the chosen metric.

artificial intelligence, planning & scheduling, rw local df, (18 more...)

arXiv.org Artificial Intelligence

Oct-23-2018

arXiv.org PDF

Add feedback

Country:
- North America > United States > California (0.28)

Genre:
- Research Report (0.70)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning
  - Search (1.00)
  - Planning & Scheduling (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found