LLark: A Multimodal Foundation Model for Music