A path to natural language through tokenisation and transformers