Sample-EfficientReinforcementLearningIsFeasible forLinearlyRealizableMDPswithLimitedRevisiting

Neural Information Processing Systems 

This paper focuses on MDPs with linearly realizable optimal Q-functionQ?.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found