R1/R3: Running time and practicality of ApproPO: In our experiments, we implement an RL oracle by a policy-2