On the grid-sampling limit SDE

Bender, Christian, Thuan, Nguyen Tran

arXiv.org Machine Learning 

In our recent work [3] we introduced the grid-sampling SDE as a proxy for modeling exploration in continuous-time reinforcement learning. In this note, we provide further motivation for the use of this SDE and discuss its wellposedness in the presence of jumps.