VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation

Open in new window