Position-basedScaledGradientforModel QuantizationandPruning-Appendix

Neural Information Processing Systems 

Inthis experiment, we only quantize the weights, not the activations, to compare the performance degradation as weight bit-width decreases. The mean squared errors (MSE) of the weights across different bit-widths are also reported. The name of the layer and the number of parameters in parenthesis are shown in the column. All numbers are results of the last epoch. Table A3: ResNet-32 trained with Adam on the CIFAR-100 dataset.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found