XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiTModulation
–Neural Information Processing Systems
Achieving fine-grained control over subject identity and semantic attributes (pose, style, lighting) in text-to-image generation, particularly for multiple subjects, often undermines the editability and coherence of Diffusion Transformers (DiTs).
Neural Information Processing Systems
Jun-14-2026, 14:37:52 GMT
- Genre:
- Research Report
- Experimental Study (1.00)
- Promising Solution (0.67)
- Research Report
- Industry:
- Information Technology (0.46)
- Technology: