Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning

Open in new window