Goto

Collaborating Authors

 miniimagenet



Meta-learning with an Adaptive Task Scheduler

Neural Information Processing Systems

To benefit the learning of a new task, meta-learning has been proposed to transfer a well-generalized meta-model learned from various meta-training tasks. Existing meta-learning algorithms randomly sample meta-training tasks with a uniform probability, under the assumption that tasks are of equal importance. However, it is likely that tasks are detrimental with noise or imbalanced given a limited number of meta-training tasks. To prevent the meta-model from being corrupted by such detrimental tasks or dominated by tasks in the majority, in this paper, we propose an adaptive task scheduler (ATS) for the meta-training process. In ATS, for the first time, we design a neural scheduler to decide which meta-training tasks to use next by predicting the probability being sampled for each candidate task, and train the scheduler to optimize the generalization capacity of the metamodel to unseen tasks. We identify two meta-model-related factors as the input of the neural scheduler, which characterize the difficulty of a candidate task to the meta-model. Theoretically, we show that a scheduler taking the two factors into account improves the meta-training loss and also the optimization landscape. Under the setting of meta-learning with noise and limited budgets, ATS improves the performance on both miniImageNet and a real-world drug discovery benchmark by up to 13%and 18%, respectively, compared to state-of-the-art task schedulers.


ASelf Supervised Learning Methods

Neural Information Processing Systems

L.1 Source Dataset: ImageNet Table 13 and Table 14 describe 5-way 1-shot and 5-way 5-shot CD-FSL performance when ImageNet is used as the source dataset, respectively. Note that Table 14 is added for convenience and this is the same with Table 3 in the main paper.





ASelf Supervised Learning Methods

Neural Information Processing Systems

Weusedtheentireimagesthatthe CUBdataset has (train, val, andtest). For example, onthe CUBdataset, theperformancegain (fork =5) is 0.249, 1.035, and 2.276for miniImageNet, tieredImageNet, and ImageNet, respectively.