TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Open in new window