from reviewers
–Neural Information Processing Systems
We thank the reviewers for the detailed and helpful reviews. Current SSL methods (including MoCo, which we base upon) train only the bottom-up encoder w/o labels. This is fundamentally different than an'intensive data augmentation', as suggested by R3. Comparison with MoCo trained with 50 extra epochs (R2, R4). Everything else is different, e.g., the high-level goal, the dataset (imagenet vs. fine-grained CUB), the loss, Why not share params in f and g (L118)?
Neural Information Processing Systems
Oct-2-2025, 14:22:54 GMT
- Technology: