Dueling Bandits with Team Comparisons Lee Cohen
–Neural Information Processing Systems
Multi-arm bandits (MAB) is a classical model of decision making under uncertainty. In spite of the simplicity of the model, it already incorporates the essential tradeoff between exploration and exploitation.
Neural Information Processing Systems
Nov-15-2025, 10:14:46 GMT
- Country:
- Asia > Middle East
- Israel > Tel Aviv District > Tel Aviv (0.04)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States (0.04)
- Asia > Middle East
- Industry:
- Leisure & Entertainment (0.68)
- Technology: