Generalized Nested Rollout Policy Adaptation with Limited Repetitions

Jan-18-2024–arXiv.org Artificial Intelligence

Generalized Nested Rollout Policy Adaptation (GNRPA) is a Monte Carlo search algorithm for optimizing a sequence of choices. We propose to improve on GNRPA by avoiding too deterministic policies that find again and again the same sequence of choices. We do so by limiting the number of repetitions of the best sequence found at a given level. Experiments show that it improves the algorithm for three different combinatorial problems: Inverse RNA Folding, the Traveling Salesman Problem with Time Windows and the Weak Schur problem.

algorithm, sequence, tristan cazenave, (10 more...)

arXiv.org Artificial Intelligence

Jan-18-2024

arXiv.org PDF

Add feedback

Country:
- Europe
  - Austria > Vienna (0.04)
  - Italy > Piedmont
    - Turin Province > Turin (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)

Genre:
- Research Report (0.50)

Industry:
- Leisure & Entertainment > Games (1.00)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Games > Go (0.47)
  - Representation & Reasoning
    - Search (0.69)
    - Planning & Scheduling (0.52)