The Importance of Pessimism in Fixed-Dataset Policy Optimization

Open in new window