A Bayesian Nonparametrics View into Deep RepresentationsSupplementary material A Collapsed Gibbs Sampling for DP-GMM

Neural Information Processing Systems 

Here we describe CGS in more details. Eqn. 10 we obtain: null null Expression under the last integral in Eqn. 13 is tractable, thanks to the conjugacy of the Normal-inverse-Wishart prior to the Gaussian likelihood. Finally, posterior predictive density (10) can be written as a mixture of multivariate Student's CIFAR experiments used the standard train/test split. Results for architectures not included in Section 4 are summarized in Fig. C.1. Table C.1: CNN architectures used in experiments (Section 4).

Similar Docs  Excel Report  more

TitleSimilaritySource
None found