PlanninginMarkovDecisionProcesseswith Gap-DependentSampleComplexity

Neural Information Processing Systems 

This problem-dependent sample complexityresult is expressed in terms of the sub-optimality gapsof the state-action pairs that are visited during exploration.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found