Memory-Driven Text-to-Image Generation