Auto-Regressive Next-Token Predictors are Universal Learners

Open in new window