Review for NeurIPS paper: Labelling unlabelled videos from scratch with multi-modal self-supervision
–Neural Information Processing Systems
Weaknesses: Required clarifications: there are some parts of the work that would require clarification, see below: * The description of the exact algorithm is not completely clear to me in the paper (and the appendix). I understand that code is provided but it should be clarified in the paper. In particular, is it a pure alternate approach? How many examples are sampled for the clustering stage (is N equal to the number of example in the dataset?) If I understand correctly, thanks to the probabilistic formulation, once the data is reclustered there is no need to reinit the last linear layer, is that correct? - If no, it is unclear to me how to apply the algorithm in an online fashion (see later for a related question).
Neural Information Processing Systems
Jan-23-2025, 05:22:42 GMT
- Technology: