ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis

Open in new window