Reinforcement Learning for Better Verbalized Confidence in Long-Form Generation

Open in new window