On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability

Open in new window