[R] Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm • r/MachineLearning

Dec-6-2017, 10:00:38 GMT–@machinelearnbot

One thing I was curious about is whether AlphaZero can play endgames. For example, a friend brought up whether AlphaZero could learn how to play Nim. For anybody who isn't familiar: https://en.wikipedia.org/wiki/Nim, the optimal strategy for Nim involves computing the xor of all the heap sizes. I thought no, largely due to the lack of gradient information/lack of structure/MCTS not being a good heuristic for the quality of the move. However, this game of Nim doesn't seem that different from say, a knight-bishop end game mating scenario for chess.

alphazero, machinelearning, mastering chess and shogi, (3 more...)

@machinelearnbot

Dec-6-2017, 10:00:38 GMT

News Web Page

Add feedback

Industry:
- Leisure & Entertainment > Games > Chess (0.67)

Technology:
- Information Technology
  - Communications > Social Media (1.00)
  - Artificial Intelligence > Machine Learning
    - Reinforcement Learning (0.85)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found