DecentralizedCooperativeStochasticBandits
–Neural Information Processing Systems
In the most basic setting of this problem, an agent has to pull one among a finite set of arms (or actions), and she receives a reward that depends on the chosen action.
Neural Information Processing Systems
Feb-12-2026, 19:43:06 GMT
- Country:
- Africa > South Sudan
- Equatoria > Central Equatoria > Juba (0.04)
- Europe > Ireland (0.04)
- North America
- Canada > British Columbia
- United States (0.04)
- Africa > South Sudan
- Technology: