On Query-efficient Planning in MDPs under Linear Realizability of the Optimal State-value Function

Open in new window