Iterative Tilting for Diffusion Fine-Tuning

Pachebat, Jean, Conforti, Giovanni, Durmus, Alain, Janati, Yazid

Dec-4-2025–arXiv.org Machine Learning

We introduce iterative tilting, a gradient-free method for fine-tuning diffusion models toward reward-tilted distributions. The method decomposes a large reward tilt $\exp(λr)$ into $N$ sequential smaller tilts, each admitting a tractable score update via first-order Taylor expansion. This requires only forward evaluations of the reward function and avoids backpropagating through sampling chains. We validate on a two-dimensional Gaussian mixture with linear reward, where the exact tilted distribution is available in closed form.

exp, gaussian, tilted distribution, (15 more...)

arXiv.org Machine Learning

Dec-4-2025

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.83)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found