High Throughput Matrix-Matrix Multiplication between Asymmetric Bit-Width Operands