Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation

Neural Information Processing Systems 

Diffusion, providing a refined and efficient way of aligning pose representation during image synthesis. We leverage the query-key self-attention mechanism of ViTs to explore the interconnections among different anatomical parts in human pose skeletons.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found