From Bytes to Ideas: Language Modeling with Autoregressive U-Nets

Open in new window