DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

Neural Information Processing Systems 

Text-to-image diffusion models have made significant progress in generating naturalistic images from textual inputs, and demonstrate the capacity to learn and represent complex visual-semantic relationships. While these diffusion models have achieved remarkable success, the underlying mechanisms driving their performance are not yet fully accounted for, with many unanswered questions surrounding what they learn, how they represent visual-semantic relationships, and why they sometimes fail to generalize.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found