RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold
–Neural Information Processing Systems
Neural Information Processing Systems
Nov-17-2025, 12:15:39 GMT
- Country:
- Asia
- China > Guangxi Province
- Nanning (0.04)
- Middle East > Jordan (0.04)
- China > Guangxi Province
- North America > United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Virginia (0.04)
- Washington > King County
- Seattle (0.04)
- Pennsylvania > Allegheny County
- Asia
- Genre:
- Research Report > Experimental Study (0.93)
- Workflow (1.00)
- Industry:
- Education (0.46)
- Information Technology (0.46)
- Technology: