How much do contextualized representations encode long-range context?