A nonparametric variable clustering model
–Neural Information Processing Systems
Factor analysis models effectively summarise the covariance structure of high dimensional data, but the solutions are typically hard to interpret. This motivates attempting to find a disjoint partition, i.e. a simple clustering, of observed variables into highly correlated subsets. We introduce a Bayesian non-parametric approach to this problem, and demonstrate advantages over heuristic methods proposed to date. Our Dirichlet process variable clustering (DPVC) model can discover blockdiagonal covariance structures in data. We evaluate our method on both synthetic and gene expression analysis problems.
Neural Information Processing Systems
Mar-14-2024, 15:37:25 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- California
- Riverside County > Riverside (0.04)
- Santa Clara County > Palo Alto (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California
- Europe > United Kingdom
- Industry: