Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model

Open in new window