Exploring Transformer Backbones for Heterogeneous Treatment Effect Estimation