Model-BasedMulti-AgentRLinZero-SumMarkov GameswithNear-OptimalSampleComplexity
–Neural Information Processing Systems
Model-based reinforcement learning (RL), which finds an optimal policy using anempirical model, has long been recognized asone ofthe cornerstones ofRL.
Neural Information Processing Systems
Feb-7-2026, 11:13:29 GMT
- Country:
- Europe > United Kingdom
- England (0.04)
- North America
- Canada > British Columbia
- United States > Illinois (0.04)
- Europe > United Kingdom
- Technology: