Efficient Cluster Selection for Personalized Federated Learning: A Multi-Armed Bandit Approach