PTQ4DiT: Post-training Quantization for Diffusion Transformers

Neural Information Processing Systems 

We discover two primary quantization challenges inherent in DiTs, notably the presence of salient channels with extreme magnitudes and the temporal variability in distributions of salient activation over multiple timesteps.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found