Near-Optimal Distributionally Robust Reinforcement Learning with General L_p Norms