The bias of the sample mean in multi-armed bandits can be positive or negative