Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Open in new window