MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance

Mao, Qi, Chen, Lan, Gu, Yuchao, Fang, Zhen, Shou, Mike Zheng

Dec-21-2023–arXiv.org Artificial Intelligence

However, localized editing in complex the other hand, mask-free methods that utilize attention injection scenarios has not been well-studied in the literature, despite mechanisms such as Prompt-to-Prompt (P2P) [10] its growing real-world demands. Existing mask-based and Plug-and-Play (PnP) [28] can preserve the original image's inpainting methods fall short of retaining the underlying structure and layout. Nevertheless, they struggle to structure within the edit region. Meanwhile, mask-free precisely align the local editing region with the intended attention-based methods often exhibit editing leakage and text in intricate scenarios, largely due to their reliance on the misalignment in more complex compositions. In this work, text prompts' localization capabilities. As a result, editing we develop MAG-Edit, a training-free, inference-stage optimization effects often extend beyond the intended area and impact method, which enables localized image editing in incorrect regions, as shown in the fourth column of Figure 1.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Dec-21-2023

arXiv.org PDF

Add feedback

Country:
- Asia
  - Singapore (0.04)
  - China (0.04)

Genre:
- Research Report (0.64)

Industry:
- Media > Photography (0.63)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (0.93)
  - Vision (0.68)
  - Representation & Reasoning (0.68)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)