Conditional Prompt Tuning for Multimodal Fusion