Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space

Open in new window