Bandits with Preference Feedback: A Stackelberg Game Perspective Barna Pásztor,1,2 ETH Zurich