Near-optimal Policy Identification in Active Reinforcement Learning

Open in new window