UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

Jan-19-2025, 12:40:21 GMT–Neural Information Processing Systems

Achieving machine autonomy and human control often represent divergent objectives in the design of interactive AI systems. Visual generative foundation models such as Stable Diffusion show promise in navigating these goals, especially when prompted with arbitrary languages. However, they often fall short in generating images with spatial, structural, or geometric controls. The integration of such controls, which can accommodate various visual conditions in a single unified model, remains an unaddressed challenge. In response, we introduce UniControl, a new generative foundation model that consolidates a wide array of controllable condition-to-image (C2I) tasks within a singular framework, while still allowing for arbitrary language prompts.

controllable visual generation, unicontrol, unified diffusion model, (2 more...)

Neural Information Processing Systems

Jan-19-2025, 12:40:21 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.41)