Quantifying Context Mixing in Transformers