Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Dec-26-2025, 00:17:42 GMT–Neural Information Processing Systems

Text-to-image generation, i.e. generating an image given a text description, is a very challenging task due to the significant semantic gap between the two domains. Humans, however, tackle this problem intelligently. We learn from diverse objects to form a solid prior about semantics, textures, colors, shapes, and layouts. Given a text description, we immediately imagine an overall visual impression using this prior and, based on this, we draw a picture by progressively adding more and more details. In this paper, and inspired by this process, we propose a novel text-to-image method called LeicaGAN to combine the above three phases in a unified framework.

imagine and create, name change, text-to-image generation, (7 more...)

Neural Information Processing Systems

Dec-26-2025, 00:17:42 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Vision (0.65)
  - Machine Learning (0.59)