Almost Horizon-Free Structure-Aware Best Policy Identification with a Generative Model

Open in new window