Simple Regret Optimization in Online Planning for Markov Decision Processes

Open in new window