Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning

Neural Information Processing Systems 

RL is essentially distinct from and probably harder than that in standard offline RL.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found