How to craft a deep reinforcement learning policy for wind farm flow control