Direct Alignment of Language Models via Quality-Aware Self-Refinement

Open in new window