Normalization Is All You Need: Understanding Layer-Normalized Federated Learning under Extreme Label Shift

Open in new window