Chain of Hindsight Aligns Language Models with Feedback

Open in new window