Expected Improvement for Contextual Bandits
–Neural Information Processing Systems
We propose two novel EI based algorithms, one when the reward function is assumed to be linear and the other for more general reward functions.
Neural Information Processing Systems
Aug-16-2025, 23:39:27 GMT
- Country:
- North America > United States
- Wisconsin > Dane County
- Madison (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Florida > Broward County
- Fort Lauderdale (0.04)
- Wisconsin > Dane County
- Europe
- United Kingdom
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Cambridgeshire
- Cambridge (0.04)
- Scotland > City of Edinburgh
- Finland > Uusimaa
- Helsinki (0.04)
- United Kingdom
- Asia
- Russia > Siberian Federal District
- Novosibirsk Oblast > Novosibirsk (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Russia > Siberian Federal District
- North America > United States
- Genre:
- Research Report (0.46)
- Technology: