Library Liberation: Competitive Performance Matmul Through Compiler-composed Nanokernels

Open in new window