Reward-Directed Conditional Diffusion: Provable Distribution Estimation and Reward Improvement

Neural Information Processing Systems 

We explore the methodology and theory of reward-directed generation via conditional diffusion models.