FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models, Ryan Po

May-29-2025, 03:43:11 GMT–Neural Information Processing Systems

Recent advances in text-to-image generation have enabled the creation of highquality images with diverse applications. However, accurately describing desired visual attributes can be challenging, especially for non-experts in art and photography. An intuitive solution involves adopting favorable attributes from source images. Current methods attempt to distill identity and style from source images. However, "style" is a broad concept that includes texture, color, and artistic elements, but does not cover other important attributes like lighting and dynamics.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

May-29-2025, 03:43:11 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.28)
- Europe (0.28)
- North America > United States
  - Hawaii (0.14)

Genre:
- Research Report > New Finding (0.93)

Industry:
- Government (0.67)
- Law (0.67)
- Media > Photography (0.88)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks (0.51)
    - Natural Language > Large Language Model (0.47)
    - Vision (1.00)
  - Sensing and Signal Processing > Image Processing (1.00)