FastPureExplorationviaFrank-Wolfe

Neural Information Processing Systems 

ConsiderK arms whose reward distributions (ν1,...,νK) come from a one-dimensional exponential family and are of unknown means µ=(µ1,...,µK).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found