57d8ebf4c2f050a6485f370d47656a9e-Supplemental-Conference.pdf
–Neural Information Processing Systems
In this section, we report the hyperparameters of each base model used in our paper, details in Table 2. The only hyperparameter that is tuned is done per dataset using a 10% validation split. In this Section, we discuss the experimental convergence of our U-DIF algorithm to the global optimum. In order to approximately compute the true global optimum, we use the following numerical scheme. (exact numbers vary by network and are given in Figure 4).
Neural Information Processing Systems
Feb-12-2026, 02:36:28 GMT
- Technology: