Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Zhiqing Sun
–Neural Information Processing Systems
Principle Engraving: In the third stage, we fine-tune the original LLM (the base model) on the self-aligned responses, generated by the LLM itself through prompting, while pruning the principles and demonstrations for the fine-tuned model.
Neural Information Processing Systems
Aug-13-2025, 20:53:40 GMT
- Country:
- North America > United States > California > Los Angeles County (0.67)
- Industry:
- Education (0.68)
- Health & Medicine
- Consumer Health (1.00)
- Therapeutic Area > Psychiatry/Psychology (1.00)
- Energy
- Information Technology > Security & Privacy (1.00)
- Consumer Products & Services (1.00)
- Law > Criminal Law (0.67)
- Government > Regional Government
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Water & Waste Management (0.93)
- Technology: