Decomposed Knowledge Distillation for Class-Incremental Semantic Segmentation Supplement

Neural Information Processing Systems 

All numbers are obtained by averaging results over five runs with standard deviations in parenthesis. All numbers are also obtained by averaging results over five runs with standard deviations. We have empirically set α to 5 for all experiments. We can also see from Table S6 that our method is robust to various choices of β .