Primer: SearchingforEfficientTransformers forLanguageModeling
–Neural Information Processing Systems
Weidentify anarchitecture, named Primer, that has a smaller training cost than the original Transformer and other variants for auto-regressive language modeling.
Neural Information Processing Systems
Feb-8-2026, 02:37:12 GMT
- Genre:
- Research Report > New Finding (0.46)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (0.48)
- Natural Language > Chatbot (0.50)
- Representation & Reasoning > Search (0.48)
- Information Technology > Artificial Intelligence