LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models

Neural Information Processing Systems 

The Transformer architecture is ubiquitously used as the building block of large-scale autoregressive language models.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found