TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models

Open in new window