When does mixup promote local linearity in learned representations?

Open in new window