GRPO-RM: Fine-Tuning Representation Models via GRPO-Driven Reinforcement Learning

Open in new window