Large Language Models: A Mathematical Formulation