Near-Optimal Distributionally Robust Reinforcement Learning with General L Norms Pierre Clavier