Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model Mark Rowland Google DeepMind

Open in new window