Evidence of Phase Transitions in Small Transformer-Based Language Models