Few-Round Learning for Federated Learning (Supplementary Material) Y ounghyun Park

Neural Information Processing Systems 

This latter observation is expected given the different design objectives. Recall that this choice was made as computing the double derivative terms would have required extra communication bandwidth as well increased computational load. The number of participating clients is set to 10. Comparison with personalized FL: Performance with both unseen/seen classes at deployment. Specifically, we decrease the number of data in each episode from 6000 to 1200 in CIFAR-100, so that each user holds only 120 images.