Towards Understanding Asynchronous Advantage Actor-critic: Convergence and Linear Speedup

Open in new window