Supplementary Materials for " Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity " A Proofs of the Main Results

Feb-7-2026, 11:13:36 GMT–Neural Information Processing Systems

We first introduce some additional notations for convenience. Our proof mainly consists of the following steps: 1. Helper lemmas and a crude bound. See A.2, and more precisely, Lemmas A.9 and A.10. 3. Final bound for null -approximate NE value. See A.3. 4. Final bounds for null -NE policy. See A.5. 14 A.1 Important Lemmas We start with the component-wise error bounds.

artificial intelligence, inequality, lemma, (17 more...)

Neural Information Processing Systems

Feb-7-2026, 11:13:36 GMT

Conferences PDF

Add feedback

Country:
- North America > United States (0.04)

Technology:
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.40)

Duplicate Docs Excel Report

Title
Supplementary Materials for " Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity " A Proofs of the Main Results

Similar Docs Excel Report more

Title	Similarity	Source
None found