FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models, Ryan Po
–Neural Information Processing Systems
Recent advances in text-to-image generation have enabled the creation of highquality images with diverse applications. However, accurately describing desired visual attributes can be challenging, especially for non-experts in art and photography. An intuitive solution involves adopting favorable attributes from source images. Current methods attempt to distill identity and style from source images. However, "style" is a broad concept that includes texture, color, and artistic elements, but does not cover other important attributes like lighting and dynamics.
Neural Information Processing Systems
May-29-2025, 03:43:11 GMT
- Country:
- Asia > China (0.28)
- Europe (0.28)
- North America > United States
- Hawaii (0.14)
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Government (0.67)
- Law (0.67)
- Media > Photography (0.88)
- Technology: