An Improved Model-Free Decision-Estimation Coefficient with Applications in Adversarial MDPs

Open in new window