Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Zhiqing Sun

Neural Information Processing Systems 

Principle Engraving: In the third stage, we fine-tune the original LLM (the base model) on the self-aligned responses, generated by the LLM itself through prompting, while pruning the principles and demonstrations for the fine-tuned model.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found