Factuality-Aware Alignment for Large Language Models
–Neural Information Processing Systems
This makes SFT less factual as it trains on human-labeled data that may be novel to the LLM. Furthermore, reward functions used in standard RL often inadequately capture factuality and favor longer and more detailed responses, which inadvertently promote hallucination.
Neural Information Processing Systems
Oct-10-2025, 17:27:36 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > Italy
- Calabria > Catanzaro Province > Catanzaro (0.04)
- Asia > Middle East
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Information Technology (0.46)
- Technology: