Relaxing the Identically Distributed Assumption in Gaussian Co-Clustering for High Dimensional Data

Gallaugher, M. P. B., Biernacki, C., McNicholas, P. D.

arXiv.org Machine Learning 

A co-clustering model for continuous data that relaxes the identically distributed assumption within blocks of traditional co-clustering is presented. The proposed model, although allowing more flexibility, still maintains the very high degree of parsimony achieved by traditional co-clustering. A stochastic EM algorithm along with a Gibbs sampler is used for parameter estimation and an ICL criterion is used for model selection. Simulated and real datasets are used for illustration and comparison with traditional co-clustering. Clustering is the process of finding and analyzing underlying group structures in heterogenous data.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found