nrpa
- Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.96)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.74)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.71)
Stabilized Nested Rollout Policy Adaptation
Cazenave, Tristan, Sevestre, Jean-Baptiste, Toulemont, Matthieu
Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to modify NRPA in order to improve the stability of the algorithm. Experiments show it improves the algorithm for different application domains: SameGame, Traveling Salesman with Time Windows and Expression Discovery.
- Oceania > Australia > Victoria > Melbourne (0.04)
- North America > United States > Arizona > Maricopa County > Phoenix (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- (7 more...)
- Transportation (0.47)
- Leisure & Entertainment > Games (0.47)
Generalized Nested Rollout Policy Adaptation
Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to generalize NRPA with a temperature and a bias and to analyze theoretically the algorithms. The generalized algorithm is named GNRPA. Experiments show it improves on NRPA for different application domains: SameGame and the Traveling Salesman Problem with Time Windows.
- North America > United States > Arizona > Maricopa County > Phoenix (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Europe > Spain > Galicia > Madrid (0.04)
- (3 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.50)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (0.36)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.35)
Nested Rollout Policy Adaptation for Monte Carlo Tree Search
Rosin, Christopher D. (Parity Computing, Inc.)
Monte Carlo tree search (MCTS) methods have had recent success in games, planning, and optimization. MCTS uses results from rollouts to guide search; a rollout is a path that descends the tree with a randomized decision at each ply until reaching a leaf. MCTS results can be strongly influenced by the choice of appropriate policy to bias the rollouts. Most previous work on MCTS uses static uniform random or domain-specific policies. We describe a new MCTS method that dynamically adapts the rollout policy during search, in deterministic optimization problems. Our starting point is Cazenave's original Nested Monte Carlo Search (NMCS), but rather than navigating the tree directly we instead use gradient ascent on the rollout policy at each level of the nested search. We benchmark this new Nested Rollout Policy Adaptation (NRPA) algorithm and examine its behavior. Our test problems are instances of Crossword Puzzle Construction and Morpion Solitaire. Over moderate time scales NRPA can substantially improve search efficiency compared to NMCS, and over longer time scales NRPA improves upon all previous published solutions for the test problems. Results include a new Morpion Solitaire solution that improves upon the previous human-generated record that had stood for over 30 years.
- North America > United States > Idaho > Ada County > Boise (0.04)
- North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
- North America > United States > California > San Diego County > San Diego (0.04)
- (2 more...)