Preference-centric Bandits: Optimality of Mixtures and Regret-efficient Algorithms

Open in new window