e562cd9c0768d5464b64cf61da7fc6bb-AuthorFeedback.pdf
–Neural Information Processing Systems
We thank the reviewers for thoughtful comments! We have an example in Table 6 in Supplement D.1: in some cases, (e.g. As with any learning algorithm, one has to be careful of extrapolation. ODE, then we could absolutely use RL to learn the parameters of that ODE. Using the learned dynamics models for planning (e.g., Dyna-style We extended Swimmer to 450k steps below.
Neural Information Processing Systems
Aug-17-2025, 01:01:56 GMT