Variance reduction of diffusion model's gradients with Taylor approximation-based control variate
Jeha, Paul, Grathwohl, Will, Andersen, Michael Riis, Ek, Carl Henrik, Frellsen, Jes
–arXiv.org Artificial Intelligence
Score-based models, trained with denoising score matching, are remarkably effective in generating high dimensional data. However, the high variance of their training objective hinders optimisation. We attempt to reduce it with a control variate, derived via a $k$-th order Taylor expansion on the training objective and its gradient. We prove an equivalence between the two and demonstrate empirically the effectiveness of our approach on a low dimensional problem setting; and study its effect on larger problems.
arXiv.org Artificial Intelligence
Aug-22-2024
- Country:
- North America
- United States > New York (0.04)
- Canada > Ontario
- Toronto (0.04)
- Europe
- Austria > Vienna (0.14)
- Denmark (0.04)
- Czechia (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- North America
- Genre:
- Instructional Material (0.72)
- Research Report > New Finding (0.46)
- Technology: