Factuality-Aware Alignment for Large Language Models
–Neural Information Processing Systems
This makes SFT less factual as it trains on human-labeled data that may be novel to the LLM. Furthermore, reward functions used in standard RL often inadequately capture factuality and favor longer and more detailed responses, which inadvertently promote hallucination.
Neural Information Processing Systems
Oct-10-2025, 17:27:36 GMT
- Country:
- Europe > Italy
- Calabria > Catanzaro Province > Catanzaro (0.04)
- Asia > Middle East
- Jordan (0.04)
- Europe > Italy
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Information Technology (0.46)
- Technology: