Decentralized Randomly Distributed Multi-agent Multi-armed Bandit with Heterogeneous Rewards
–Neural Information Processing Systems
Each client pulls an arm and communicates with neighbors based on the graph provided by the environment.
Neural Information Processing Systems
Feb-17-2026, 19:40:18 GMT
- Country:
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- Technology: