Scalable Language Models with Posterior Inference of Latent Thought Vectors

Open in new window