SDPO: Segment-Level Direct Preference Optimization for Social Agents

Open in new window