LiteTransformerSearch: Training-free Neural Architecture Search for Efficient Language Models
–Neural Information Processing Systems
The Transformer architecture is ubiquitously used as the building block of large-scale autoregressive language models.
Neural Information Processing Systems
Aug-17-2025, 04:55:42 GMT
- Country:
- North America > United States > California > San Diego County > San Diego (0.04)
- Genre:
- Research Report > New Finding (0.93)
- Technology: