On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation

Open in new window