Are sample means in multi-armed bandits positively or negatively biased?

Open in new window