SOLE: Hardware-Software Co-design of Softmax and LayerNorm for Efficient Transformer Inference