Robust Model Evaluation over Large-scale Federated Networks