Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision Zhiqing Sun

Aug-13-2025, 20:53:40 GMT–Neural Information Processing Systems

Principle Engraving: In the third stage, we fine-tune the original LLM (the base model) on the self-aligned responses, generated by the LLM itself through prompting, while pruning the principles and demonstrations for the fine-tuned model.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Aug-13-2025, 20:53:40 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > California > Los Angeles County (0.67)

Genre:
- Personal > Interview (0.45)

Industry:
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Consumer Products & Services (1.00)
- Information Technology > Security & Privacy (1.00)
- Water & Waste Management (0.93)
- Education (0.68)
- Law > Criminal Law (0.67)
- Government > Regional Government
  - North America Government > United States Government (0.67)
- Energy
  - Renewable (0.69)
  - Oil & Gas (0.67)
- Health & Medicine
  - Therapeutic Area > Psychiatry/Psychology (1.00)
  - Consumer Health (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Chatbot (0.96)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
0764db1151b936aca59249e2c1386101-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found