NearInstance-OptimalPACReinforcementLearning forDeterministicMDPs

Neural Information Processing Systems 

Whileminimax optimal algorithms exist for this problem, its instance-dependent complexity remains elusiveinepisodic Markovdecision processes (MDPs).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found