sponse addressing one common point raised by Reviewer 1 and Reviewer 3 regarding how to handle the case where 2 null
–Neural Information Processing Systems
We thank all the reviewers for their careful feedback and will revise our paper accordingly. Such a fact is presented in the classic paper "An analysis of temporal-difference learning with function Similar facts can be found for other TD algorithms (e.g. Reviewer 1 is correct in that a discount factor is needed. Now we address specific reviewer comments below. A reference for this is the classic paper "An Finally, the "-" sign in Line 213 is due to the Hurwtiz assumption.
Neural Information Processing Systems
Aug-20-2025, 06:29:57 GMT
- Technology: