Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision

Open in new window