An Efficient Dynamic Sampling Policy For Monte Carlo Tree Search

Apr-25-2022–arXiv.org Artificial Intelligence

Monte Carlo Tree Search (MCTS) is a popular tree-based search strategy within the framework of reinforcement learning (RL), which estimates the optimal value of a state and action by building a tree with Monte Carlo simulation. It has been widely used in sequential decision makings, including scheduling problems, inventory, production management, and real-world games, such as Go, Chess, Tic-tac-toe and Chinese Checkers. See Browne et al. (2012), Fu (2018) and Świechowski et al. (2021) for thorough overviews. MCTS uses little or no domain knowledge and self learns by running more simulations. Many variations have been proposed for MCTS to improve its performance. In particular, deep neural networks are combined into MCTS to achieve a remarkable success in the game of Go (Silver et al. 2016, 2017). A basic MCTS is to build a game tree from the root node in an incremental and asymmetric manner, where nodes correspond to states and edges correspond to possible state-action pairs.

artificial intelligence, machine learning, tree policy, (16 more...)

arXiv.org Artificial Intelligence

Apr-25-2022

arXiv.org PDF

Add feedback

Country:
- Europe (0.93)

Genre:
- Research Report (0.40)

Industry:
- Leisure & Entertainment > Games > Tic-Tac-Toe (0.35)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Representation & Reasoning
    - Planning & Scheduling (1.00)
    - Search (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found