Learning While Scheduling in Multi-Server Systems with Unknown Statistics: MaxWeight with Discounted UCB