Robust Offline Reinforcement Learning with Linearly Structured $f$-Divergence Regularization

Open in new window