Finite-TimeRegretofThompsonSampling AlgorithmsforExponentialFamilyMulti-Armed Bandits

Open in new window