AITopics | mix-of-show

Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained significant attention from the community. These models can be easily customized for new concepts using low-rank adaptations (LoRAs). However, the utilization of multiple-concept LoRAs to jointly support multiple customized concepts presents a challenge. We refer to this scenario as decentralized multi-concept customization, which involves single-client concept tuning and center-node concept fusion. In this paper, we propose a new framework called Mix-of-Show that addresses the challenges of decentralized multi-concept customization, including concept conflicts resulting from existing single-client LoRA tuning and identity loss during model fusion. Mix-of-Show adopts an embedding-decomposed LoRA (ED-LoRA) for single-client tuning and gradient fusion for the center node to preserve the in-domain essence of single concepts and support theoretically limitless concept fusion. Additionally, we introduce regionally controllable sampling, which extends spatially controllable sampling (e.g., ControlNet and T2I-Adapter) to address attribute binding and missing object problems in multi-concept sampling. Extensive experiments demonstrate that Mix-of-Show is capable of composing multiple customized concepts with high fidelity, including characters, objects, and scenes.

decentralized low-rank adaptation, mix-of-show, multi-concept customization, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Supplementary Material for ' Mix-of-Show ' Y uchao Gu

Neural Information Processing SystemsOct-8-2025, 10:13:02 GMT

The object part is borrowed from Dreambooth [1] and Custom Diffusion [2]. We use a 0.01 noise offset for all In single-client concept tuning, the process of tuning each concept takes approximately 10-20 minutes on two Nvidia-A100 GPUs, taking into account variations in data volume. In Restylization, we explore the concept's ability to adapt to various artistic In Interaction, we investigate the concept's capability to interact with other objects, such as In Property Modification, we modify the internal state of the concept, including expressions or states like running or jumping. This yields a total of 1000 images for each concept. According to the evaluation setting described in Sec.

artificial intelligence, machine learning, mix-of-show, (14 more...)

Neural Information Processing Systems

Country: Asia > Singapore (0.04)

Industry: Information Technology (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.69)

Add feedback

3340ee1e4a8bad8d32c35721712b4d0a-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 10:13:01 GMT

arxiv preprint arxiv, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Republic of Türkiye > Batman Province > Batman (0.05)
Asia > Singapore (0.04)
North America > United States > Virginia (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)

Add feedback

Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

Neural Information Processing SystemsOct-11-2024, 01:44:55 GMT

Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained significant attention from the community. These models can be easily customized for new concepts using low-rank adaptations (LoRAs). However, the utilization of multiple-concept LoRAs to jointly support multiple customized concepts presents a challenge. We refer to this scenario as decentralized multi-concept customization, which involves single-client concept tuning and center-node concept fusion. In this paper, we propose a new framework called Mix-of-Show that addresses the challenges of decentralized multi-concept customization, including concept conflicts resulting from existing single-client LoRA tuning and identity loss during model fusion.

decentralized low-rank adaptation, mix-of-show, multi-concept customization, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models

Kwon, Gihyun, Jenni, Simon, Li, Dingzeyu, Lee, Joon-Young, Ye, Jong Chul, Heilbron, Fabian Caba

arXiv.org Artificial IntelligenceApr-5-2024

While there has been significant progress in customizing text-to-image generation models, generating images that combine multiple personalized concepts remains challenging. In this work, we introduce Concept Weaver, a method for composing customized text-to-image diffusion models at inference time. Specifically, the method breaks the process into two steps: creating a template image aligned with the semantics of input prompts, and then personalizing the template using a concept fusion strategy. The fusion strategy incorporates the appearance of the target concepts into the template image while retaining its structural details. The results indicate that our method can generate multiple custom concepts with higher identity fidelity compared to alternative approaches. Furthermore, the method is shown to seamlessly handle more than two concepts and closely follow the semantic meaning of the input prompt without blending appearances across different subjects.

concept weaver, custom concept, template image, (14 more...)

arXiv.org Artificial Intelligence

2404.03913

Genre: Research Report > New Finding (0.46)

Technology: