Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation-Supplementary Material-Bingchen Zhao
–Neural Information Processing Systems
The initial learning rate is set to 0.1 for all datasets except ImageNet-1K, and is scheduled to decay by a factor of 10 at the 170th epochs. We also carry out experiments using "hard" and "soft" cosine similarity. For the "hard" cosine similarity, we simply adopt a threshold (0.9 in our experiments) on the score to get binary pseudo labels. While for the "soft" cosine similarity, we directly take the score as soft pseudo labels. The results are presented in table 3.
Neural Information Processing Systems
Nov-15-2025, 14:32:37 GMT