On the Power of Convolution Augmented Transformer