CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay

Open in new window