Moving Beyond Next-Token Prediction: Transformers are Context-Sensitive Language Generators

Open in new window