Reviewer
–Neural Information Processing Systems
We thank all reviewers for their insightful comments. We subsequently use this characterization to bound the train and test loss in Lemma F.2. "mnist necessarily less complex than cifar10?" [R8] use spectrally normalized margin distributions to show "...if drop-out or batch-norm have any influence on the obtained results" We will add another section on the effect of batch-norm in the revised version.
Neural Information Processing Systems
Nov-14-2025, 05:12:51 GMT
- Technology: