Adaptive Choice of Grid and Time in Reinforcement Learning

Pareigis, Stephan

Neural Information Processing Systems 

Consistency problems arise if the discretization needs to be refined, e.g. for more accuracy, application of multi-grid iteration or better starting values for the iteration of the approximate optimal value function. In [7] it was shown, that for diffusion dominated problems, a state to time discretization ratio k/ h of Ch'r, I

Similar Docs  Excel Report  more

TitleSimilaritySource
None found