High-ThroughputSynchronousDeepRL

Neural Information Processing Systems 

Deep reinforcement learning (RL) is computationally demanding and requiresprocessing of many data points. Synchronous methods enjoy training stability while having lowerdatathroughput.