Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets

Open in new window