Deep Compression of Pre-trained Transformer Models

Aug-15-2025, 02:17:42 GMT–Neural Information Processing Systems

Due to their excellent computational efficiency and scalability, transformer models can be trained on exceedingly large amounts of data at the expense of tremendous growth in model size.

arxiv preprint arxiv, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Aug-15-2025, 02:17:42 GMT

Conferences PDF

Country:
- North America > United States (0.04)
- South America > Chile
  - Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre:
- Research Report (0.95)

Industry:
- Information Technology (0.93)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Vision (0.69)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
5b5618e7d061748267d74478b7c5b1ab-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found