Advancing Direct Convolution using Convolution Slicing Optimization and ISA Extensions