APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation

Open in new window