Supplementary Material for " Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect " Kaihua T ang

Neural Information Processing Systems 

It's worth noting that although there are non-linear activation layers in