Aligning LLM Agents by Learning Latent Preference from User Edits

Open in new window