Transformers on Markov data: Constant depth suffices