Extracting Reward Functions from Diffusion Models
–Neural Information Processing Systems
We consider the problem of extracting a reward function by comparing a decision-making diffusion model that models low-reward behavior and one that models high-reward behavior; a setting related to inverse reinforcement learning. We first define the notion of a relative reward function of two diffusion models and show conditions under which it exists and is unique.
Neural Information Processing Systems
Feb-16-2026, 03:43:02 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- England
- North America > United States
- Illinois > Cook County > Chicago (0.04)
- Asia > Middle East
- Genre:
- Overview (0.46)
- Technology: