Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning