Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking
Distributed training has been shown to greatly accelerate the training of RL agents with respect to wall clock time (Mnih et al., 2016; Espeholt et al., 2018). Instead of interacting with a single environment at a time, the agent interacts with a set of differently seeded copies of the environment to diversify experience and increase throughput. For simplicity and to ensure reproducibility, Tonic uses a synchronous training loop illustrated in Figure 3.
Nov-20-2020, 07:25:47 GMT
- Technology: