Pure exploration in multi-armed bandits with low rank structure using oblivious sampler

Open in new window