GoAi #1: Asynchronous Methods for Deep Reinforcement Learning
First, if you don't have the background about deep reinforcement learning, you can think of it as major algorithm behind AlphaGo. Therefore, authors provide asynchronous Methods for Deep Reinforcement Learning to overcome these drawbacks. Using CPU instead of GPU, we can open multi thread to run the same environment but share the same model weight. After reading the pseudocode, we find that there is little difference from original DQN algorithm. The special point is the line -- t mod Iasyncupdate.
Jul-13-2016, 11:55:49 GMT
- Technology: