Chain of Hindsight Aligns Language Models with Feedback