A Separable Self-attention Inspired by the State Space Model for Computer Vision