An Adaptive Approach for Infinitely Many-armed Bandits under Generalized Rotting Constraints
–Neural Information Processing Systems
In this study, we consider the infinitely many-armed bandit problems in a rested rotting setting, where the mean reward of an arm may decrease with each pull, while otherwise, it remains unchanged.
Neural Information Processing Systems
May-28-2025, 11:51:31 GMT
- Country:
- Europe > United Kingdom > England (0.14)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.87)
- Research Report
- Technology: