Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization

Open in new window