1 Experiment Details

Neural Information Processing Systems 

Clearly, with only 26M learnable parameters, the performance can be boosted from 79.9 to 84.4