Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge
–Neural Information Processing Systems
Text-to-image generation, i.e. generating an image given a text description, is a very challenging task due to the significant semantic gap between the two domains. Humans, however, tackle this problem intelligently. We learn from diverse objects to form a solid prior about semantics, textures, colors, shapes, and layouts. Given a text description, we immediately imagine an overall visual impression using this prior and, based on this, we draw a picture by progressively adding more and more details. In this paper, and inspired by this process, we propose a novel text-to-image method called LeicaGAN to combine the above three phases in a unified framework.
Neural Information Processing Systems
Dec-26-2025, 00:17:42 GMT
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (0.59)
- Vision (0.65)
- Information Technology > Artificial Intelligence