BaNEL: Exploration Posteriors for Generative Modeling Using Only Negative Rewards

Open in new window