A Flexible Instruction Set Architecture for Efficient GEMMs