PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression
–Neural Information Processing Systems
There has been significant interest in "extreme" compression of large language models (LLMs), i.e., to 1-2 bits per parameter, which allows such models to be executed efficiently on resource-constrained devices.
Neural Information Processing Systems
Oct-9-2025, 18:00:38 GMT
- Country:
- Asia
- Middle East
- Saudi Arabia (0.04)
- UAE (0.04)
- Russia (0.04)
- Middle East
- Europe
- Austria (0.04)
- Germany > Baden-Württemberg
- Stuttgart Region > Stuttgart (0.04)
- Italy > Tuscany
- Florence (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Asia
- Genre:
- Research Report
- Experimental Study (0.92)
- New Finding (1.00)
- Research Report
- Industry:
- Education (0.45)
- Information Technology (0.45)
- Technology: