Improving Model and Search for Computer Go

Feb-5-2021–arXiv.org Artificial Intelligence

The standard for Deep Reinforcement Learning in games, following Alpha Zero, is to use residual networks and to increase the depth of the network to get better results. We propose to improve mobile networks as an alternative to residual networks and experimentally show the playing strength of the networks according to both their width and their depth. We also propose a generalization of the PUCT search algorithm that improves on PUCT.

accuracy, model and search, residual network, (15 more...)

arXiv.org Artificial Intelligence

Feb-5-2021

arXiv.org PDF

Add feedback

Country:
- Europe
  - France (0.14)
  - Italy > Piedmont
    - Turin Province > Turin (0.04)

Genre:
- Research Report (0.82)

Industry:
- Leisure & Entertainment > Games > Go (0.70)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (1.00)
  - Games > Go (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found