Distillation Can Make AI Models Smaller and Cheaper
A fundamental technique lets researchers use a big, expensive model to train another model for less. The Chinese AI company DeepSeek released a chatbot earlier this year called R1, which drew a huge amount of attention. Most of it focused on the fact that a relatively small and unknown company said it had built a chatbot that rivaled the performance of those from the world's most famous AI companies, but using a fraction of the computer power and cost. As a result, the stocks of many Western tech companies plummeted; Nvidia, which sells the chips that run leading AI models, lost more stock value in a single day than any company in history. Some of that attention involved an element of accusation.
Sep-20-2025, 11:00:00 GMT
- Country:
- Africa (0.05)
- Asia
- China (0.05)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.05)
- Europe
- North America
- Central America (0.05)
- United States
- California > San Francisco County
- San Francisco (0.05)
- Pennsylvania (0.05)
- California > San Francisco County
- South America (0.05)
- Genre:
- Research Report (0.70)
- Industry:
- Information Technology (1.00)
- Technology: