eba237eccc24353ccaa4d62013556ac6-AuthorFeedback.pdf
–Neural Information Processing Systems
We thank all reviewers for their time and appreciate the thoughtful feedback. Below, we address the main comments. "In the example given by the author, the agent is allowed to run until it reaches a terminal state during We understand why this would be a concern, but it is actually not what we do. On the topic of terminal states, note that we have not explicitly defined any terminal states for the tasks from Figure 1. We will clarify this point further in the paper. "Their approach was marginally better than DQN on most Atari games [...] it would be nice to see some We hope that our clarification of the Figure 1 plots has increased your appreciation of low discount factors.
Neural Information Processing Systems
Aug-20-2025, 08:18:40 GMT