Adaptive Choice of Grid and Time in Reinforcement Learning
–Neural Information Processing Systems
Consistency problems arise if the discretization needs to be refined, e.g. for more accuracy, application of multi-grid iteration or better starting values for the iteration of the approximate optimal value function. In [7] it was shown, that for diffusion dominated problems, a state to time discretization ratio k/ h of Ch'r, I
Neural Information Processing Systems
Dec-31-1998
- Technology: