Is Pessimism Provably Efficient for Offline RL?

Open in new window