Supplemental Material for What Neural Networks Memorize and Why A Proof of Lemma 2.1
–Neural Information Processing Systems
We now compute the expected squared error of each of the terms of this estimator. In both cases the squared error is at most 1 / 4 . We implement our algorithms with Tensorflow [1]. Our implementation achieves 73% top-1 accuracy when trained on the full training set. For DenseNet, we halved the batch size and learning rate due to higher memory load of the architecture.
Neural Information Processing Systems
Nov-13-2025, 11:53:25 GMT
- Technology: