KTO: Model Alignment as Prospect Theoretic Optimization

Open in new window