Gramian Multimodal Representation Learning and Alignment