Learning the distribution with largest mean: two bandit frameworks

Open in new window