Interactive Dialogue Agents via Reinforcement Learning on Hindsight Regenerations