Convolutional Self-Attention Networks