's interpretation,we'll first contrast our work with Gelada's
–Neural Information Processing Systems
We thank all reviewers for their time and comments. Here are some general responses followed by individual ones. ACE, it would just be an actor-critic analogue of Gelada's Q-learning approach as This has not been done in RL and cannot be handled by ACE. We will include a comparison with TD3 in the next version of the paper as shown by Figure 1. Somewhat surprisingly, TD3 does not work better than DDPG in our setup.
Neural Information Processing Systems
Nov-15-2025, 20:32:59 GMT
- Technology: