Unsupervised clustering of file dialects according to monotonic decompositions of mixtures