RLAC: Reinforcement Learning with Adversarial Critic for Free-Form Generation Tasks

Open in new window