Fine-tuning Flow Matching Generative Models with Intermediate Feedback
Fan, Jiajun, Cheng, Chaoran, Shen, Shuaike, Zhou, Xiangxin, Liu, Ge
–arXiv.org Artificial Intelligence
Flow-based generative models have shown remarkable success in text-to-image generation, yet fine-tuning them with intermediate feedback remains challenging, especially for continuous-time flow matching models. Most existing approaches solely learn from outcome rewards, struggling with the credit assignment problem. Alternative methods that attempt to learn a critic via direct regression on cumulative rewards often face training instabilities and model collapse in online settings. We present AC-Flow, a robust actor-critic framework that addresses these challenges through three key innovations: (1) reward shaping that provides well-normalized learning signals to enable stable intermediate value learning and gradient control, (2) a novel dual-stability mechanism that combines advantage clipping to prevent destructive policy updates with a warm-up phase that allows the critic to mature before influencing the actor, and (3) a scalable generalized critic weighting scheme that extends traditional reward-weighted methods while preserving model diversity through Wasserstein regularization. Through extensive experiments on Stable Diffusion 3, we demonstrate that AC-Flow achieves state-of-the-art performance in text-to-image alignment tasks and generalization to unseen human preference models. Our results demonstrate that even with a computationally efficient critic model, we can robustly finetune flow models without compromising generative quality, diversity, or stability.
arXiv.org Artificial Intelligence
Oct-22-2025
- Country:
- Africa
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Rwanda > Kigali
- Kigali (0.04)
- Ethiopia > Addis Ababa
- Asia
- Middle East > Jordan (0.04)
- Singapore (0.04)
- Europe > Austria
- Vienna (0.14)
- North America
- Canada > Quebec
- Montreal (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- United States
- Illinois > Champaign County
- Urbana (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Baltimore (0.04)
- Oregon > Benton County
- Corvallis (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Washington > King County
- Seattle (0.04)
- Illinois > Champaign County
- Canada > Quebec
- Africa
- Genre:
- Research Report > New Finding (0.68)
- Technology: