Expected Improvement for Contextual Bandits

Neural Information Processing Systems 

We propose two novel EI based algorithms, one when the reward function is assumed to be linear and the other for more general reward functions.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found