RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs

Open in new window