Best Policy Identification in Linear MDPs

Open in new window