Extracting Reward Functions from Diffusion Models

Neural Information Processing Systems 

We consider the problem of extracting a reward function by comparing a decision-making diffusion model that models low-reward behavior and one that models high-reward behavior; a setting related to inverse reinforcement learning. We first define the notion of a relative reward function of two diffusion models and show conditions under which it exists and is unique.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found