Output-Weighted Sampling for Multi-Armed Bandits with Extreme Payoffs

Open in new window