Supplemental Material for What Neural Networks Memorize and Why A Proof of Lemma 2.1

Neural Information Processing Systems 

We now compute the expected squared error of each of the terms of this estimator. In both cases the squared error is at most 1 / 4 . We implement our algorithms with Tensorflow [1]. Our implementation achieves 73% top-1 accuracy when trained on the full training set. For DenseNet, we halved the batch size and learning rate due to higher memory load of the architecture.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found