Regulatory DNA sequence Design with Reinforcement Learning