We thank the reviewers for their time and thorough comments, as well as their valuation of our work including its
–Neural Information Processing Systems
For the larger discussion items, please find the detailed comments below. Additionally, the reviewers highlighted the importance of quantitative fits. We currently attempt to differentiate between these models using additional manipulations. R-learning may be advantageous for computation. Our work builds upon results in the field including Ref [2] This observation enabled us to pursue the hypothesis of the leaky estimate of average reward.
Neural Information Processing Systems
Aug-16-2025, 18:41:01 GMT
- Technology: