Language Models as Hierarchy Encoders