Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings Appendix
–Neural Information Processing Systems
We provide hyper-parameters of our models in Table A.1. Table A.1: Hyper-parameters used for training our VisualCSE and AudioCSE. Vision, we use Dropout augmentation (the same strategy in SimCSE) for AudioCSE. We compare unsup-SimCSE and unsup-VisualCSE on a small scale retrieval test. As shown in Table C.1, VisualCSE generally retrieves qualitatively different sentences than SimCSE.
Neural Information Processing Systems
Feb-12-2026, 13:16:03 GMT