Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation

Zhu, Wanrong, Wang, Xinyi, Lu, Yujie, Fu, Tsu-Jui, Wang, Xin Eric, Eckstein, Miguel, Wang, William Yang

Oct-28-2023–arXiv.org Artificial Intelligence

The field of text-to-image (T2I) generation has garnered significant attention both within the research community and among everyday users. Despite the advancements of T2I models, a common issue encountered by users is the need for repetitive editing of input prompts in order to receive a satisfactory image, which is time-consuming and labor-intensive. Given the demonstrated text generation power of large-scale language models, such as GPT-k, we investigate the potential of utilizing such models to improve the prompt editing process for T2I generation. We conduct a series of experiments to compare the common edits made by humans and GPT-k, evaluate the performance of GPT-k in prompting T2I, and examine factors that may influence this process. We found that GPT-k models focus more on inserting modifiers while humans tend to replace words and phrases, which includes changes to the subject matter. Experimental results show that GPT-k are more effective in adjusting modifiers rather than predicting spontaneous changes in the primary subject matters. Adopting the edit suggested by GPT-k models may reduce the percentage of remaining edits by 20-30%.

editing trace, gpt-k model, similarity, (11 more...)

arXiv.org Artificial Intelligence

Oct-28-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States > California (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Asia > Japan
  - Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)

Genre:
- Research Report > New Finding (0.88)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (0.50)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found