Mo\^usai: Text-to-Music Generation with Long-Context Latent Diffusion