Transformer Approximations from ReLUs