FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models

May-26-2025, 21:27:36 GMT–Neural Information Processing Systems

Recent advances in text-to-image generation have enabled the creation of high-quality images with diverse applications. However, accurately describing desired visual attributes can be challenging, especially for non-experts in art and photography. An intuitive solution involves adopting favorable attributes from source images. Current methods attempt to distill identity and style from source images. However, "style" is a broad concept that includes texture, color, and artistic elements, but does not cover other important attributes like lighting and dynamics.

artificial intelligence, fine-grained visual attribute dataset, machine learning, (4 more...)

Neural Information Processing Systems

May-26-2025, 21:27:36 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)