Pretrained Transformers as Universal Computation Engines

Open in new window