Reviews: Single-Agent Policy Tree Search With Guarantees

Oct-7-2024, 10:26:43 GMT–Neural Information Processing Systems

The paper presents two planning algorithms based on tree search. The novelty of these algorithms is the use of a policy based criterion instead of a standard heuristic to guide the search. The first algorithm (LevinTS) uses a cost function d(n)/\pi(n) which ensures nodes are expanded in a best-first order. This allows the authors to derive an upper bound to the number of expansions performed before to reach a target node. The second algorithm (LubyTS) is a randomized algorithm based on the sampling of trajectories.

algorithm, expansion, single-agent policy tree search, (10 more...)

Neural Information Processing Systems

Oct-7-2024, 10:26:43 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning
  - Search (0.73)
  - Planning & Scheduling (0.52)