Window-based Model Averaging Improves Generalization in Heterogeneous Federated Learning