about the assumptions, related work, and evaluation. CONTENT
–Neural Information Processing Systems
We thank all reviewers for their valuable time and feedback. Note that multiple recent works (offline and online) simply assume a linear MDP with known features in analysis. KL-divergence formulation to impose different distribution priors when available. R2, thank you very much for the suggestions. We agree about Section 3.3 and in retrospect should have saved We will remove it and move some of the Appendix into the paper.
Neural Information Processing Systems
May-30-2025, 04:51:15 GMT
- Technology: