AITopics | imagine and create

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Neural Information Processing SystemsDec-26-2025, 00:17:42 GMT

Text-to-image generation, i.e. generating an image given a text description, is a very challenging task due to the significant semantic gap between the two domains. Humans, however, tackle this problem intelligently. We learn from diverse objects to form a solid prior about semantics, textures, colors, shapes, and layouts. Given a text description, we immediately imagine an overall visual impression using this prior and, based on this, we draw a picture by progressively adding more and more details. In this paper, and inspired by this process, we propose a novel text-to-image method called LeicaGAN to combine the above three phases in a unified framework.

imagine and create, name change, text-to-image generation, (7 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.65)
Information Technology > Artificial Intelligence > Machine Learning (0.59)

Add feedback

Reviews: Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Neural Information Processing SystemsJan-27-2025, 06:07:31 GMT

Quality The paper is thorough in describing the method and supporting the proposed method with experiments Clarity The paper is well written and easy to follow. Originality & Significance Although the method is not very novel in light of Paper 1253: RecreateGAN (see more below), the experimental exploration of different settings of the method is thoroughly done and interesting. The idea of matching local image features to word-level embeddings and matching global image features to sentence level embeddings is intuitive and makes sense. This paper shares significant parts of the method with Paper 1253: RecreateGAN, in particular, the textual-visual embedding loss in (6) of this paper matches the pairwise loss defined in eq (5) of the other paper. However, this paper uses this component as part of a different method, namely for textual-visual embedding vs an image similarity embedding. Additionally, the cascade of attentional generators in this papers is very similar between both papers.

imagine and create, similarity, text-to-image generation, (7 more...)

Neural Information Processing Systems

Genre: Summary/Review (0.39)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.95)
Information Technology > Artificial Intelligence > Vision (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Reviews: Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Neural Information Processing SystemsJan-27-2025, 06:07:20 GMT

This paper proposes a new method for text to image generation. Pros • A new method is proposed. Cons • There are several issues pointed out by the reviewers. Some of them have been addressed in the author response. The authors have promised to add more.

imagine and create, knowledge, text-to-image generation, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Neural Information Processing SystemsOct-11-2024, 00:06:54 GMT

Text-to-image generation, i.e. generating an image given a text description, is a very challenging task due to the significant semantic gap between the two domains. Humans, however, tackle this problem intelligently. We learn from diverse objects to form a solid prior about semantics, textures, colors, shapes, and layouts. Given a text description, we immediately imagine an overall visual impression using this prior and, based on this, we draw a picture by progressively adding more and more details. In this paper, and inspired by this process, we propose a novel text-to-image method called LeicaGAN to combine the above three phases in a unified framework.

imagine and create, leicagan, text-to-image generation, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Add feedback

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Qiao, Tingting, Zhang, Jing, Xu, Duanqing, Tao, Dacheng

Neural Information Processing SystemsMar-18-2020, 20:45:47 GMT

Text-to-image generation, i.e. generating an image given a text description, is a very challenging task due to the significant semantic gap between the two domains. Humans, however, tackle this problem intelligently. We learn from diverse objects to form a solid prior about semantics, textures, colors, shapes, and layouts. Given a text description, we immediately imagine an overall visual impression using this prior and, based on this, we draw a picture by progressively adding more and more details. In this paper, and inspired by this process, we propose a novel text-to-image method called LeicaGAN to combine the above three phases in a unified framework.

imagine and create, leicagan, text-to-image generation, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.64)

Add feedback

Filters

Collaborating Authors

imagine and create

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Reviews: Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Reviews: Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge