Language Models as Hierarchy Encoders

May-26-2025, 17:41:55 GMT–Neural Information Processing Systems

Interpreting hierarchical structures latent in language is a key limitation of current language models (LMs). While previous research has implicitly leveraged these hierarchies to enhance LMs, approaches for their explicit encoding are yet to be explored. To address this, we introduce a novel approach to re-train transformer encoder-based LMs as Hierarchy Transformer encoders (HiTs), harnessing the expansive nature of hyperbolic space. Our method situates the output embedding space of pre-trained LMs within a Poincaré ball with a curvature that adapts to the embedding dimension, followed by re-training on hyperbolic clustering and centripetal losses. These losses are designed to effectively cluster related entities (input as texts) and organise them hierarchically.

artificial intelligence, hierarchy encoder, natural language, (5 more...)

Neural Information Processing Systems

May-26-2025, 17:41:55 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report (0.44)

Technology:
- Information Technology > Artificial Intelligence > Natural Language (0.65)