Looking Beyond Single Images for Contrastive Semantic Segmentation Learning Supplementary Material

Neural Information Processing Systems 

We report the proxy mean IoU (PmIoU) of the auxiliary labels on the validation set together with the mIoU after pre-training and fine-tuning. The first condition (#1) establishes a baseline using only intra-image contrast without any auxiliary labels. Positive correspondences are generated by matching pixels across different augmentations of the same image. Here, positive correspondences are established between pixels originating from the same superpixel. The remaining conditions vary the feature extractors, the clustering algorithm, and the use of superpixels.