Iterative T eacher-A ware Learning Supplementary Material

Neural Information Processing Systems 

First we provide an intuition for the assumption. Next, we start the proof. We need the following lemma: Lemma 1. Now we prove the main theorem. We used two types of loss functions in all the experiment.