Multi-Agent Learning with Heterogeneous Linear Contextual Bandits
–Neural Information Processing Systems
UCB, wherein agents cooperatively minimize the group regret under the coordination of a central server.
Neural Information Processing Systems
Feb-18-2026, 01:20:41 GMT
- Country:
- Genre:
- Research Report (0.46)
- Technology: