Style Aligned Image Generation via Shared Attention

Hertz, Amir, Voynov, Andrey, Fruchter, Shlomi, Cohen-Or, Daniel

Jan-11-2024–arXiv.org Artificial Intelligence

Large-scale Text-to-Image (T2I) models have rapidly gained prominence across creative fields, generating visually compelling outputs from textual prompts. However, controlling these models to ensure consistent style remains challenging, with existing methods necessitating fine-tuning and manual intervention to disentangle content and style. In this paper, we introduce StyleAligned, a novel technique designed to establish style alignment among a series of generated images. By employing minimal `attention sharing' during the diffusion process, our method maintains style consistency across images within T2I models. This approach allows for the creation of style-consistent images using a reference style through a straightforward inversion operation. Our method's evaluation across diverse styles and text prompts demonstrates high-quality synthesis and fidelity, underscoring its efficacy in achieving consistent style across various inputs.

diffusion model, reference image, stylealigned, (14 more...)

arXiv.org Artificial Intelligence

Jan-11-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.04)
- South America > Brazil
  - Rio de Janeiro > Rio de Janeiro (0.04)
- Pacific Ocean > North Pacific Ocean
  - San Francisco Bay > Golden Gate (0.04)
- Europe
  - Romania > Sud - Muntenia Development Region
    - Giurgiu County > Giurgiu (0.04)
  - Germany > Bavaria
    - Upper Bavaria > Munich (0.04)
- Asia > Middle East
  - Saudi Arabia > Northern Borders Province
    - Arar (0.04)
  - Israel > Tel Aviv District
    - Tel Aviv (0.04)

Genre:
- Research Report (1.00)

Industry:
- Leisure & Entertainment (0.46)

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (1.00)
  - Artificial Intelligence
    - Vision (1.00)
    - Natural Language (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.93)