Near-Optimal Distributionally Robust Reinforcement Learning with General L_p Norms

Open in new window