GRPO-RM: Fine-Tuning Representation Models via GRPO-Driven Reinforcement Learning