Reinforcement Learning for Sequence Design Leveraging Protein Language Models