Scaling Physical Reasoning with the PHYSICS Dataset