Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithms Miao Lu1 Han Zhong 2 Tong Zhang 3 Jose Blanchet

Open in new window