Review for NeurIPS paper: Stochastic Normalization

Feb-5-2025, 06:19:51 GMT–Neural Information Processing Systems

Summary and Contributions: This paper introduces a novel method to prevent overfitting when fine-tuning a pre-trained network for a new task using a small training set. The paper proposes a hybrid batch normalization layer, called stochastic normalization that, randomly switches the normalization statistics between: those calculated from the current min-batch and the moving average statistics. The authors replace the standard batch normalization layer of different network architectures such as VGG-16, Inception-V3, and Resnet-50 with their proposed stochastic normalization and show empirically that the fine-tuning using the adopted architecture outperforms multiple existing methods for over-fitting problem in fine-tuning. Overall, the paper is studying a very important problem and the proposed method seems to be working in practice. The major problem I have with this paper is the lack of consistency in the experimental set up.

network backbone, rebuttal, stochastic normalization, (10 more...)

Neural Information Processing Systems

Feb-5-2025, 06:19:51 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report > Promising Solution (0.38)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.94)