Linguistic Binding in Diffusion Models: Enhancing Attribute Correspondence through Attention Map Alignment

Neural Information Processing Systems 

Text-conditioned image generation models often generate incorrect associations between entities and their visual attributes.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found