Uncovering Uncertainty in Transformer Inference