Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization

Neural Information Processing Systems 

That is, to minimize simple regret. However, this objective remains understudied.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found