Multimodal Latent Language Modeling with Next-Token Diffusion

Open in new window