Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics