Modeling Transformers as complex networks to analyze learning dynamics