Goto

Collaborating Authors

 momentum









57d8ebf4c2f050a6485f370d47656a9e-Supplemental-Conference.pdf

Neural Information Processing Systems

In this section, we report the hyperparameters of each base model used in our paper, details in Table 2. The only hyperparameter that is tuned is done per dataset using a 10% validation split. In this Section, we discuss the experimental convergence of our U-DIF algorithm to the global optimum. In order to approximately compute the true global optimum, we use the following numerical scheme. (exact numbers vary by network and are given in Figure 4).


Fixed-Distance Hamiltonian Monte Carlo

Neural Information Processing Systems

Markov chain Monte Carlo (MCMC) is an inference mechanism that approximates a target probability distribution by a sequence of states (a.k.a.