Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization