DPMM-CFL: Clustered Federated Learning via Dirichlet Process Mixture Model Nonparametric Clustering

Jaramillo-Civill, Mariona, Wu, Peng, Closas, Pau

Oct-9-2025–arXiv.org Machine Learning

Clustered Federated Learning (CFL) improves performance under non-IID client heterogeneity by clustering clients and training one model per cluster, thereby balancing between a global model and fully personalized models. However, most CFL methods require the number of clusters K to be fixed a priori, which is impractical when the latent structure is unknown. We propose DPMM-CFL, a CFL algorithm that places a Dirichlet Process (DP) prior over the distribution of cluster parameters. This enables nonparametric Bayesian inference to jointly infer both the number of clusters and client assignments, while optimizing per-cluster federated objectives. This results in a method where, at each round, federated updates and cluster inferences are coupled, as presented in this paper. The algorithm is validated on benchmark datasets under Dirichlet and class-split non-IID partitions.

assignment, clustered federated learning, federated learning, (12 more...)

arXiv.org Machine Learning

Oct-9-2025

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Virginia (0.04)
  - Massachusetts
    - Suffolk County > Boston (0.04)
    - Middlesex County > Cambridge (0.04)
  - California > San Diego County
    - San Diego (0.04)
- Asia > India
  - Telangana > Hyderabad (0.04)

Genre:
- Research Report (0.40)

Industry:
- Health & Medicine (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.68)
  - Machine Learning
    - Statistical Learning > Clustering (0.73)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found