Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction