On Monte Carlo Tree Search and Reinforcement Learning

Vodopivec, Tom, Samothrakis, Spyridon, Ster, Branko

Dec-20-2017–Journal of Artificial Intelligence Research

Fuelled by successes in Computer Go, Monte Carlo tree search (MCTS) has achieved widespread adoption within the games community. Its links to traditional reinforcement learning (RL) methods have been outlined in the past; however, the use of RL techniques within tree search has not been thoroughly studied yet. In this paper we re-examine in depth this close relation between the two fields; our goal is to improve the cross-awareness between the two communities. We show that a straightforward adaptation of RL semantics within tree search can lead to a wealth of new algorithms, for which the traditional MCTS is only one of the variants. We confirm that planning methods inspired by RL in conjunction with online search demonstrate encouraging results on several classic board games and in arcade video game competitions, where our algorithm recently ranked first. Our study promotes a unified view of learning, planning, and search.

algorithm, backup, sarsa-uct, (16 more...)

Journal of Artificial Intelligence Research

Dec-20-2017

Journals PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.14)
- North America
  - United States
    - District of Columbia > Washington (0.04)
    - Texas > Travis County
      - Austin (0.04)
    - New York > New York County
      - New York City (0.04)
    - California > San Francisco County
      - San Francisco (0.14)
  - Canada > Alberta
    - Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.04)
- Europe
  - Czechia > Prague (0.04)
  - United Kingdom > England
    - Essex (0.04)
    - Cambridgeshire > Cambridge (0.04)
  - Slovenia > Central Slovenia
    - Municipality of Ljubljana > Ljubljana (0.04)
  - Netherlands > Limburg
    - Maastricht (0.04)
  - Germany
    - North Rhine-Westphalia > Arnsberg Region
      - Dortmund (0.04)
    - Baden-Württemberg > Karlsruhe Region
      - Heidelberg (0.04)
  - Estonia > Tartu County
    - Tartu (0.04)
- Asia > China
  - Beijing > Beijing (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Leisure & Entertainment > Games
  - Computer Games (0.87)
  - Go (0.66)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Search (1.00)
    - Planning & Scheduling (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Learning Graphical Models > Undirected Networks
      - Markov Models (0.45)