Structure of the supplementary material

Aug-16-2025, 03:23:11 GMT–Neural Information Processing Systems

Appendix B provides the proofs for the results of the basic setting presented in Section 3. Appendix C provides the proofs and additional discussion for the results of the concave-convex setting presented in Section 4. Appendix F provides auxiliary concentration lemmas useful for the derivation of our results. RL, is presented at Algorithm 1. In this setting, unlike basic setting, objective and constraints are not linear. Similar to before, expressing this program based on occupation measures provides a convex program. We define the bonus-enhanced cMDP, i.e.

bellman error, lanner, probability, (14 more...)

Neural Information Processing Systems

Aug-16-2025, 03:23:11 GMT

Conferences PDF

Add feedback

Genre:
- Research Report > New Finding (0.34)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Duplicate Docs Excel Report

Title
bc6d753857fe3dd4275dff707dedf329-Supplemental.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found