Shapeshifter: a Parameter-efficient Transformer using Factorized Reshaped Matrices

Neural Information Processing Systems 

Language models employ a very large number of trainable parameters.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found