OneBit: Towards Extremely Low-bit Large Language Models

Neural Information Processing Systems 

LLMs difficult beyond mid-to-high-end GPUs like the A100, let alone on mobile devices. The high demand for resources not only drives up usage costs, but also restricts their wider application.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found