SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers

Open in new window