Slagle - 2024 - SpaceByte: Towards Deleting Tokenization from Large Language Modeling

Kevin Slagle

Neural Information Processing Systems 

In this work, we study the performance of byte-level and subword-level autoregressive models when trained using a fixed compute budget.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found