We thank the reviewers for the comments and constructive feedback and we are delighted that they appreciated the
–Neural Information Processing Systems
RL-as-inference (see discussion in Section 4.3), they differ crucially in how the objective is interpreted.
Neural Information Processing Systems
Feb-7-2026, 19:34:23 GMT
- Technology: