LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

Neural Information Processing Systems 

However, such inputs impose a substantial burden on users when compared to simple text inputs. To address the issue, we study how Large Language Models (LLMs) can serve as visual planners by generating layouts from text conditions, and thus collaborate with visual generative models. We propose LayoutGPT, a method to compose in-context visual demonstrations in style sheet language to enhance the visual planning skills of LLMs.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found