Explaining The Efficacy of Counterfactually-Augmented Data