Leveraging Variation Theory in Counterfactual Data Augmentation for Optimized Active Learning