Multi-Token Prediction Needs Registers

Open in new window