Leveraging Inter-Layer Dependency for Post-Training Quantization
–Neural Information Processing Systems
Prior works on Post-training Quantization (PTQ) typically separate a neural network into sub-nets and quantize them sequentially.
Neural Information Processing Systems
Oct-3-2025, 05:53:33 GMT