Measuring the Faithfulness of Thinking Drafts in Large Reasoning Models

Open in new window