Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators
–Neural Information Processing Systems
We provide theoretical guarantees for CBPR's rapid convergence to the optimal policy once human partners alter their policies.
Neural Information Processing Systems
Oct-10-2025, 09:29:41 GMT
- Country:
- Asia
- China > Shaanxi Province
- Xi'an (0.04)
- Middle East > Jordan (0.04)
- China > Shaanxi Province
- North America > United States
- Indiana > St. Joseph County
- Notre Dame (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- Indiana > St. Joseph County
- Asia
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Leisure & Entertainment > Games (1.00)
- Technology: