QBB: Quantization with Binary Bases for LLMs Adrian Bulat 1,2 Samsung AI Cambridge 2

Open in new window