AlleviatingtheSampleSelectionBiasinFew-shot LearningbyRemovingProjectiontotheCentroid

Neural Information Processing Systems 

While agood feature extractor may help cluster unseen data, thetask distribution shift between training andtesting [25] still makes it hard to estimate novel class distribution using a small number of samples from the support set. Thus, the performance is strongly correlated with the sample quality of the support data.