When is dataset cartography ineffective? Using training dynamics does not improve robustness against Adversarial SQuAD