A Formal description of our method

Neural Information Processing Systems 

In this section we provide extended experimental results that show the student's test accuracy over the The student's test accuracy over the training trajectory using hard-distillation corresponding to the experiments of Figure 4. See Section 3.1.2 Figure 8. See Section 3.1.4 Temperature-scaling, a technique introduced in the original paper of Hinton et. Indeed, it is known (see e.g. The results can be found in the table below.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found