Long-Short Transformer: Efficient Transformers for Language and Vision (Appendix) A Details of Norm Comparisons