On a Connection Between Imitation Learning and RLHF

Open in new window