Risk-aware Direct Preference Optimization under Nested Risk Measure

Open in new window