A Bayesian Nonparametrics View into Deep RepresentationsSupplementary material A Collapsed Gibbs Sampling for DP-GMM
–Neural Information Processing Systems
Here we describe CGS in more details. Eqn. 10 we obtain: null null Expression under the last integral in Eqn. 13 is tractable, thanks to the conjugacy of the Normal-inverse-Wishart prior to the Gaussian likelihood. Finally, posterior predictive density (10) can be written as a mixture of multivariate Student's CIFAR experiments used the standard train/test split. Results for architectures not included in Section 4 are summarized in Fig. C.1. Table C.1: CNN architectures used in experiments (Section 4).
Neural Information Processing Systems
Oct-2-2025, 02:00:29 GMT