Accelerating Sparse Ternary GEMM for Quantized ML on Apple Silicon

Open in new window