BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback

Open in new window