Supplementary Materials: FiV A: Fine-grained Visual Attribute Dataset for T ext-to-Image Diffusion Models
–Neural Information Processing Systems
Section A. We then introduce additional details on dataset construction in Section B. Further, we Finally, we discuss the limitations and future work of the project in Section D. Please also find the Details on attribute taxonomy and statistics. We visualize the rough distribution of visual attributes and subjects on the left. We also visualize the attribute alignment accuracy via human validation here. Due to space limitations, only 15 sub-subjects are listed for each major-subject. The result shows that Image 4 exhibits inconsistencies, with the reasons provided.
Neural Information Processing Systems
Oct-9-2025, 23:32:10 GMT