Mean-based Best Arm Identification in Stochastic Bandits under Reward Contamination
–Neural Information Processing Systems
This paper investigates the problem of best arm identification in contaminated stochastic multi-arm bandits.
Neural Information Processing Systems
Nov-14-2025, 03:12:45 GMT
- Country:
- Asia
- Japan > Kyūshū & Okinawa
- Okinawa (0.04)
- Middle East > Israel
- Haifa District > Haifa (0.04)
- Japan > Kyūshū & Okinawa
- Europe
- Netherlands > North Holland
- Amsterdam (0.04)
- Portugal > Porto
- Porto (0.04)
- Spain > Canary Islands (0.04)
- Netherlands > North Holland
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- Arizona > Maricopa County
- Phoenix (0.04)
- California > Los Angeles County
- Los Angeles (0.14)
- Wisconsin > Dane County
- Madison (0.04)
- Arizona > Maricopa County
- Canada > Ontario
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Technology: