Vulnerability-Aware Robust Multimodal Adversarial Training