AlphaGo Zero demystified

Aug-20-2018, 17:25:47 GMT–#artificialintelligence

DeepMind has shaken the world of Reinforcement Learning and Go with its creation AlphaGo, and later AlphaGo Zero. It is the first computer program to beat a human professional Go player without handicap on a 19 x 19 board. It has also beaten the world champion Lee Sedol 4 games to 1, Ke Jie (number one world ranked player at the time) and many other top ranked players with the Zero version. The game of Go is a difficult environment because of its very large branching factor at every move which makes classical techniques such as alpha-beta pruning and heuristic search unrealistic. I will present my work on reproducing the paper as closely as I could. This article will again require background knowledge in Machine Learning and Python, as I will make references to my own implementation.

artificial intelligence, deep learning, machine learning, (16 more...)

#artificialintelligence

Aug-20-2018, 17:25:47 GMT

News Web Page

Add feedback

Country:
- North America > United States (0.05)

Industry:
- Leisure & Entertainment > Games > Go (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Search (0.88)
  - Games > Go (0.84)
  - Machine Learning > Neural Networks
    - Deep Learning (0.67)