Language Models as Ontology Encoders
Yang, Hui, Chen, Jiaoyan, He, Yuan, Gao, Yongsheng, Horrocks, Ian
–arXiv.org Artificial Intelligence
OWL (Web Ontology Language) ontologies which are able to formally represent complex knowledge and support semantic reasoning have been widely adopted across various domains such as healthcare and bioinformatics. Recently, ontology embeddings have gained wide attention due to its potential to infer plausible new knowledge and approximate complex reasoning. However, existing methods face notable limitations: geometric model-based embeddings typically overlook valuable textual information, resulting in suboptimal performance, while the approaches that incorporate text, which are often based on language models, fail to preserve the logical structure. In this work, we propose a new ontology embedding method OnT, which tunes a Pretrained Language Model (PLM) via geometric modeling in a hyperbolic space for effectively incorporating textual labels and simultaneously preserving class hierarchies and other logical relationships of Description Logic EL. Extensive experiments on four real-world ontologies show that OnT consistently outperforms the baselines including the state-of-the-art across both tasks of prediction and inference of axioms. OnT also demonstrates strong potential in real-world applications, indicated by its robust transfer learning abilities and effectiveness in real cases of constructing a new ontology from SNOMED CT. Data and code are available at https://github.com/HuiYang1997/OnT.
arXiv.org Artificial Intelligence
Jul-22-2025
- Country:
- Asia > Singapore (0.04)
- Europe
- Greece > Attica
- Athens (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- United Kingdom
- England > Oxfordshire
- Oxford (0.04)
- Scotland > City of Edinburgh
- Edinburgh (0.04)
- England > Oxfordshire
- Greece > Attica
- North America
- Canada > British Columbia
- United States > California
- Los Angeles County > Long Beach (0.04)
- Santa Clara County > Palo Alto (0.04)
- Genre:
- Research Report (0.82)
- Industry:
- Health & Medicine (0.87)
- Technology: