A Survey of Techniques for Optimizing Transformer Inference