Spatial Ensemble: a Novel Model Smoothing Mechanism for Student-Teacher Framework
–Neural Information Processing Systems
Model smoothing is of central importance for obtaining a reliable teacher model in the student-teacher framework, where the teacher generates surrogate supervision signals to train the student. A popular model smoothing method is the Temporal Moving Average (TMA), which continuously averages the teacher parameters with the up-to-date student parameters. In this paper, we propose ''Spatial Ensemble'', a novel model smoothing mechanism in parallel with TMA. Spatial Ensemble randomly picks up a small fragment of the student model to directly replace the corresponding fragment of the teacher model.
Neural Information Processing Systems
Dec-24-2025, 09:57:28 GMT