Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

Open in new window