Why Batch Normalization Damage Federated Learning on Non-IID Data?

Open in new window