Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data

Open in new window