a1: Steep Test-time Scaling Law via Environment Augmented Generation

Open in new window