A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility

Open in new window