A Theory of Multimodal Learning