Personalized Reinforcement Learning with a Budget of Policies