ProvablyEfficientReinforcementLearningwith LinearFunctionApproximationunderAdaptivity Constraints

Open in new window