Transformers Can Learn Posterior Predictive Distributions In-Context

Open in new window