Meta-Learning with Self-Improving Momentum Target

Oct-10-2024, 10:54:46 GMT–Neural Information Processing Systems

The idea of using a separately trained target model (or teacher) to improve the performance of the student model has been increasingly popular in various machine learning domains, and meta-learning is no exception; a recent discovery shows that utilizing task-wise target models can significantly boost the generalization performance. However, obtaining a target model for each task can be highly expensive, especially when the number of tasks for meta-learning is large. To tackle this issue, we propose a simple yet effective method, coined Self-improving Momentum Target (SiMT). SiMT generates the target model by adapting from the temporal ensemble of the meta-learner, i.e., the momentum network. This momentum network and its task-specific adaptations enjoy a favorable generalization performance, enabling self-improving of the meta-learner through knowledge distillation.

meta-learning, self-improving momentum target, target model, (3 more...)

Neural Information Processing Systems

Oct-10-2024, 10:54:46 GMT

Conferences Web Page

Add feedback

Genre:
- Play > Prospect (0.60)

Industry:
- Education (0.42)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)