Assessing Robustness via Score-Based Adversarial Image Generation