Knowledge Distillation by On-the-Fly Native Ensemble

Feb-13-2026, 19:16:59 GMT–Neural Information Processing Systems

Knowledge distillation is effective to train the small and generalisable network models for meeting the low-memory and fast running requirements. Existing offline distillation methods rely on a strong pre-trained teacher, which enables favourable knowledge discovery and transfer but requires a complex two-phase training procedure. Online counterparts address this limitation at the price of lacking a high-capacity teacher. In this work, we present an On-the-fly Native Ensemble (ONE) learning strategyforone-stage online distillation.

artificial intelligence, distillation, machine learning, (15 more...)

Neural Information Processing Systems

Feb-13-2026, 19:16:59 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada > Quebec > Montreal (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning (1.00)
  - Vision (0.94)

Duplicate Docs Excel Report

Title
Knowledge Distillation by On-the-Fly Native Ensemble
Knowledge Distillation by On-the-Fly Native Ensemble

Similar Docs Excel Report more

Title	Similarity	Source
None found