Solving the Inverse Alignment Problem for Efficient RLHF

Open in new window