Relaxing the Identically Distributed Assumption in Gaussian Co-Clustering for High Dimensional Data
Gallaugher, M. P. B., Biernacki, C., McNicholas, P. D.
A co-clustering model for continuous data that relaxes the identically distributed assumption within blocks of traditional co-clustering is presented. The proposed model, although allowing more flexibility, still maintains the very high degree of parsimony achieved by traditional co-clustering. A stochastic EM algorithm along with a Gibbs sampler is used for parameter estimation and an ICL criterion is used for model selection. Simulated and real datasets are used for illustration and comparison with traditional co-clustering. Clustering is the process of finding and analyzing underlying group structures in heterogenous data.
Aug-25-2018
- Country:
- North America
- United States > California
- San Francisco County > San Francisco (0.04)
- Canada > Ontario
- Toronto (0.14)
- United States > California
- North America
- Genre:
- Research Report (0.64)
- Technology: