AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback

Open in new window