The Power of Hard Attention Transformers on Data Sequences: A Formal Language Theoretic Perspective Chris Köcher RPTU Kaiserslautern-Landau