PackQViT: Faster Sub-8-bit Vision Transformers via Full and Packed Quantization on the Mobile Peiyan Dong 1, Lei Lu
–Neural Information Processing Systems
Model quantization is a widely-used technique to optimize the hardware efficiency of deep neural networks.
Neural Information Processing Systems
Feb-8-2026, 14:45:16 GMT
- Country:
- Europe > Switzerland > Zürich > Zürich (0.14)
- Genre:
- Research Report > Promising Solution (0.46)
- Technology: