Inverse-RLignment: Inverse Reinforcement Learning from Demonstrations for LLM Alignment

Open in new window