Stackelberg Self-Annotation: ARobust Approach to Data-Efficient LLMAlignment

Open in new window