Classifier-guided Gradient Modulation for Enhanced Multimodal Learning