Manipulating Embeddings of Stable Diffusion Prompts
Deckers, Niklas, Peters, Julia, Potthast, Martin
–arXiv.org Artificial Intelligence
Generative text-to-image models such as Stable Diffusion allow users to generate images based on a textual description, the prompt. Changing the prompt is still the primary means for the user to change a generated image as desired. However, changing the image by reformulating the prompt remains a difficult process of trial and error, which has led to the emergence of prompt engineering as a new field of research. We propose and analyze methods to change the embedding of a prompt directly instead of the prompt text. It allows for more fine-grained and targeted control that takes into account user intentions. Our approach treats the generative text-to-image model as a continuous function and passes gradients between the image space and the prompt embedding space. By addressing different user interaction problems, we can apply this idea in three scenarios: (1) Optimization of a metric defined in image space that could measure, for example, image style. (2) Assistance of users in creative tasks by enabling them to navigate the image space along a selection of directions of "near" prompt embeddings. (3) Changing the embedding of the prompt to include information that the user has seen in a particular seed but finds difficult to describe in the prompt. Our experiments demonstrate the feasibility of the described methods.
arXiv.org Artificial Intelligence
Aug-23-2023
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Maryland > Baltimore (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Los Angeles County > Long Beach (0.04)
- Canada > Ontario
- Toronto (0.04)
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- Italy > Calabria
- Catanzaro Province > Catanzaro (0.04)
- Germany > Saxony
- Leipzig (0.05)
- France > Hauts-de-France
- United Kingdom > England
- Asia > Middle East
- UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- Africa > Rwanda
- North America
- Genre:
- Research Report (0.64)
- Technology: