On Provable Benefits of Muon in Federated Learning