Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations