Aligning Diffusion Models by Optimizing Human Utility
–Neural Information Processing Systems
We present Diffusion-KTO, a novel approach for aligning text-to-image diffusion models by formulating the alignment objective as the maximization of expected human utility. Unlike previous methods, Diffusion-KTO does not require collecting pairwise preference data nor training a complex reward model. Instead, our objective uses per-image binary feedback signals, e.g.
Neural Information Processing Systems
Dec-24-2025, 16:21:51 GMT
- Technology: