Communication-Efficient Federated Learning through Adaptive Weight Clustering and Server-Side Distillation