AITopics | Ontologies

Collaborating Authors

Ontologies

"An ontology defines the terms used to describe and represent an area of knowledge. … Ontologies include computer-usable definitions of basic concepts in the domain and the relationships among them."
– from OWL Web Ontology Language Use Cases and Requirements. W3C Recommendation (10 February 2004). Jeff Heflin, editor.

News Overviews Instructional Materials AI-Alerts Classics

A Survey on Knowledge Organization Systems of Research Fields: Resources and Challenges

Salatino, Angelo, Aggarwal, Tanay, Mannocci, Andrea, Osborne, Francesco, Motta, Enrico

arXiv.org Artificial IntelligenceSep-6-2024

Knowledge Organization Systems (KOSs), such as term lists, thesauri, taxonomies, and ontologies, play a fundamental role in categorising, managing, and retrieving information. In the academic domain, KOSs are often adopted for representing research areas and their relationships, primarily aiming to classify research articles, academic courses, patents, books, scientific venues, domain experts, grants, software, experiment materials, and several other relevant products and agents. These structured representations of research areas, widely embraced by many academic fields, have proven effective in empowering AI-based systems to i) enhance retrievability of relevant documents, ii) enable advanced analytic solutions to quantify the impact of academic research, and iii) analyse and forecast research dynamics. This paper aims to present a comprehensive survey of the current KOS for academic disciplines. We analysed and compared 45 KOSs according to five main dimensions: scope, structure, curation, usage, and links to other KOSs. Our results reveal a very heterogeneous scenario in terms of scope, scale, quality, and usage, highlighting the need for more integrated solutions for representing research knowledge across academic fields. We conclude by discussing the main challenges and the most promising future directions.

koss, ontology, thesaurus, (11 more...)

arXiv.org Artificial Intelligence

2409.04432

Country:

Oceania > New Zealand (0.14)
North America > Canada > Alberta (0.14)
North America > United States > New York > New York County > New York City (0.04)
(19 more...)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Education (1.00)
Government > Regional Government (0.94)
Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Information Management > Search (1.00)
Information Technology > Data Science > Data Mining (1.00)
(7 more...)

Add feedback

Medical Concept Normalization in a Low-Resource Setting

Patzelt, Tim

arXiv.org Artificial IntelligenceSep-6-2024

In the field of biomedical natural language processing, medical concept normalization is a crucial task for accurately mapping mentions of concepts to a large knowledge base. However, this task becomes even more challenging in low-resource settings, where limited data and resources are available. In this thesis, I explore the challenges of medical concept normalization in a low-resource setting. Specifically, I investigate the shortcomings of current medical concept normalization methods applied to German lay texts. Since there is no suitable dataset available, a dataset consisting of posts from a German medical online forum is annotated with concepts from the Unified Medical Language System. The experiments demonstrate that multilingual Transformer-based models are able to outperform string similarity methods. The use of contextual information to improve the normalization of lay mentions is also examined, but led to inferior results. Based on the results of the best performing model, I present a systematic error analysis and lay out potential improvements to mitigate frequent errors.

dataset, information, representation, (17 more...)

arXiv.org Artificial Intelligence

2409.14579

Country: Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Promising Solution (0.67)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.87)

Add feedback

Redefining Data-Centric Design: A New Approach with a Domain Model and Core Data Ontology for Computational Systems

Johnson, William, Davis, James, Kelly, Tara

arXiv.org Artificial IntelligenceSep-1-2024

Before this, fragmented computer networks struggled to communicate seamlessly. The introduction of the Transmission Control Protocol/Internet Protocol (TCP/IP) enabled consistent data transfer and became the standard for digital communication. However, this node-centric approach, which relies heavily on Internet Protocol (IP) addresses, has also created significant security vulnerabilities and privacy concerns due to its focus on network nodes rather than the data itself. In today's digital landscape, the centralized aggregation and storage of sensitive user data -- including IP addresses -- by service providers pose substantial security risks. These centralized repositories are prime targets for cyberattacks, potentially compromising user privacy and exposing sensitive information. Additionally, the reliance on IP-based system modeling has amplified these risks, necessitating a shift toward a more secure and resilient design approach. This paper proposes a novel data-centric design methodology that moves away from traditional node-focused models. By prioritizing data as the central entity and incorporating multimodal frameworks encompassing objects, events, concepts, and actions, this approach enhances data security and flexibility. The new informatics domain model reimagines data's role in system design, emphasizing its importance throughout its entire lifecycle to foster innovation, security, and seamless data interoperability.

application example, data-centric model, ontology, (9 more...)

arXiv.org Artificial Intelligence

2409.09058

Genre: Research Report (0.40)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.34)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Add feedback

Building FKG.in: a Knowledge Graph for Indian Food

Gupta, Saransh Kumar, Dey, Lipika, Das, Partha Pratim, Jain, Ramesh

arXiv.org Artificial IntelligenceSep-1-2024

This paper presents an ontology design along with knowledge engineering, and multilingual semantic reasoning techniques to build an automated system for assimilating culinary information for Indian food in the form of a knowledge graph. The main focus is on designing intelligent methods to derive ontology designs and capture all-encompassing knowledge about food, recipes, ingredients, cooking characteristics, and most importantly, nutrition, at scale. We present our ongoing work in this workshop paper, describe in some detail the relevant challenges in curating knowledge of Indian food, and propose our high-level ontology design. We also present a novel workflow that uses AI, LLM, and language technology to curate information from recipe blog sites in the public domain to build knowledge graphs for Indian food. The methods for knowledge curation proposed in this paper are generic and can be replicated for any domain. The design is application-agnostic and can be used for AI-driven smart analysis, building recommendation systems for Personalized Digital Health, and complementing the knowledge graph for Indian food with contextual information such as user information, food biochemistry, geographic information, agricultural information, etc.

information, ingredient, knowledge graph, (15 more...)

arXiv.org Artificial Intelligence

2409.0083

Country:

North America > United States > New York (0.04)
Europe > Spain (0.04)
Europe > Portugal (0.04)
(7 more...)

Genre: Research Report (0.40)

Industry:

Health & Medicine > Consumer Health (1.00)
Education > Health & Safety > School Nutrition (0.94)
Consumer Products & Services (0.94)
Food & Agriculture > Agriculture (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Add feedback

Hierarchical Blockmodelling for Knowledge Graphs

Pietrasik, Marcin, Reformat, Marek, Wilbik, Anna

arXiv.org Artificial IntelligenceAug-28-2024

In this paper, we investigate the use of probabilistic graphical models, specifically stochastic blockmodels, for the purpose of hierarchical entity clustering on knowledge graphs. These models, seldom used in the Semantic Web community, decompose a graph into a set of probability distributions. The parameters of these distributions are then inferred allowing for their subsequent sampling to generate a random graph. In a non-parametric setting, this allows for the induction of hierarchical clusterings without prior constraints on the hierarchy's structure. Specifically, this is achieved by the integration of the Nested Chinese Restaurant Process and the Stick Breaking Process into the generative model. In this regard, we propose a model leveraging such integration and derive a collapsed Gibbs sampling scheme for its inference. To aid in understanding, we describe the steps in this derivation and provide an implementation for the sampler. We evaluate our model on synthetic and real-world datasets and quantitatively compare against benchmark models. We further evaluate our results qualitatively and find that our model is capable of inducing coherent cluster hierarchies in small scale settings. The work presented in this paper provides the first step for the further application of stochastic blockmodels for knowledge graphs on a larger scale. We conclude the paper with potential avenues for future work on more scalable inference schemes.

hierarchy, knowledge graph, pqr, (11 more...)

arXiv.org Artificial Intelligence

2408.15649

Country:

North America > United States (0.28)
Pacific Ocean (0.04)
Europe > Netherlands > Limburg > Maastricht (0.04)
(28 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Government (0.67)
Consumer Products & Services > Restaurants (0.48)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
(2 more...)

Add feedback

Tripl\`etoile: Extraction of Knowledge from Microblogging Text

Zavarella, Vanni, Consoli, Sergio, Recupero, Diego Reforgiato, Fenu, Gianni, Angioni, Simone, Buscaldi, Davide, Dessì, Danilo, Osborne, Francesco

arXiv.org Artificial IntelligenceAug-27-2024

Numerous methods and pipelines have recently emerged for the automatic extraction of knowledge graphs from documents such as scientific publications and patents. However, adapting these methods to incorporate alternative text sources like micro-blogging posts and news has proven challenging as they struggle to model open-domain entities and relations, typically found in these sources. In this paper, we propose an enhanced information extraction pipeline tailored to the extraction of a knowledge graph comprising open-domain entities from micro-blogging posts on social media platforms. Our pipeline leverages dependency parsing and classifies entity relations in an unsupervised manner through hierarchical clustering over word embeddings. We provide a use case on extracting semantic triples from a corpus of 100 thousand tweets about digital transformation and publicly release the generated knowledge graph. On the same dataset, we conduct two experimental evaluations, showing that the system produces triples with precision over 95% and outperforms similar pipelines of around 5% in terms of precision, while generating a comparatively higher number of triples.

knowledge graph, pipeline, relation, (13 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.heliyon.2024.e32479

2408.14908

Country:

Europe > Switzerland (0.04)
Europe > Italy > Sardinia > Cagliari (0.04)
South America > Brazil > Rio Grande do Sul > Porto Alegre (0.04)
(10 more...)

Genre: Research Report (1.00)

Industry:

Banking & Finance (1.00)
Health & Medicine (0.93)
Media > News (0.86)
(2 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
(8 more...)

Add feedback

Relationships are Complicated! An Analysis of Relationships Between Datasets on the Web

Lin, Kate, Alrashed, Tarfah, Noy, Natasha

arXiv.org Artificial IntelligenceAug-26-2024

The Web today has millions of datasets, and the number of datasets continues to grow at a rapid pace. These datasets are not standalone entities; rather, they are intricately connected through complex relationships. Semantic relationships between datasets provide critical insights for research and decision-making processes. In this paper, we study dataset relationships from the perspective of users who discover, use, and share datasets on the Web: what relationships are important for different tasks? What contextual information might users want to know? We first present a comprehensive taxonomy of relationships between datasets on the Web and map these relationships to user tasks performed during dataset discovery. We develop a series of methods to identify these relationships and compare their performance on a large corpus of datasets generated from Web pages with schema.org markup. We demonstrate that machine-learning based methods that use dataset metadata achieve multi-class classification accuracy of 90%. Finally, we highlight gaps in available semantic markup for datasets and discuss how incorporating comprehensive semantics can facilitate the identification of dataset relationships. By providing a comprehensive overview of dataset relationships at scale, this paper sets a benchmark for future research.

dataset, dataset relationship, metadata, (16 more...)

arXiv.org Artificial Intelligence

2408.14636

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California (0.04)

Genre:

Research Report (0.50)
Overview (0.48)

Industry: Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Communications > Web (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

KGPrune: a Web Application to Extract Subgraphs of Interest from Wikidata with Analogical Pruning

Monnin, Pierre, Nousradine, Cherif-Hassan, Jarnac, Lucas, Zuckerman, Laurel, Couceiro, Miguel

arXiv.org Artificial IntelligenceAug-26-2024

Knowledge graphs (KGs) have become ubiquitous publicly available knowledge sources, and are nowadays covering an ever increasing array of domains. However, not all knowledge represented is useful or pertaining when considering a new application or specific task. Also, due to their increasing size, handling large KGs in their entirety entails scalability issues. These two aspects asks for efficient methods to extract subgraphs of interest from existing KGs. To this aim, we introduce KGPrune, a Web Application that, given seed entities of interest and properties to traverse, extracts their neighboring subgraphs from Wikidata. To avoid topical drift, KGPrune relies on a frugal pruning algorithm based on analogical reasoning to only keep relevant neighbors while pruning irrelevant ones. The interest of KGPrune is illustrated by two concrete applications, namely, bootstrapping an enterprise KG and extracting knowledge related to looted artworks.

kgp rune, seed entity, wikidata, (12 more...)

arXiv.org Artificial Intelligence

2408.14658

Country:

Europe > Portugal > Lisbon > Lisbon (0.04)
Europe > Greece (0.04)
North America > United States > New York > New York County > New York City (0.04)
(11 more...)

Genre: Research Report (0.50)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.47)

Add feedback

DSTI at LLMs4OL 2024 Task A: Intrinsic versus extrinsic knowledge for type classification

Akl, Hanna Abi

arXiv.org Artificial IntelligenceAug-26-2024

We introduce semantic towers, an extrinsic knowledge representation method, and compare it to intrinsic knowledge in large language models for ontology learning. Our experiments show a trade-off between performance and semantic grounding for extrinsic knowledge compared to a fine-tuned model intrinsic knowledge. We report our findings on the Large Language Models for Ontology Learning (LLMs4OL) 2024 challenge.

category, llms4ol 2024, semantic tower, (11 more...)

arXiv.org Artificial Intelligence

2408.14236

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Germany > Berlin (0.04)
Europe > France > Île-de-France > Paris > Paris (0.04)
Europe > France > Provence-Alpes-Côte d'Azur (0.04)

Genre: Research Report > New Finding (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.78)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.77)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.64)

Add feedback

Has Multimodal Learning Delivered Universal Intelligence in Healthcare? A Comprehensive Survey

Lin, Qika, Zhu, Yifan, Mei, Xin, Huang, Ling, Ma, Jingying, He, Kai, Peng, Zhen, Cambria, Erik, Feng, Mengling

arXiv.org Artificial IntelligenceAug-23-2024

The rapid development of artificial intelligence has constantly reshaped the field of intelligent healthcare and medicine. As a vital technology, multimodal learning has increasingly garnered interest due to data complementarity, comprehensive modeling form, and great application potential. Currently, numerous researchers are dedicating their attention to this field, conducting extensive studies and constructing abundant intelligent systems. Naturally, an open question arises that has multimodal learning delivered universal intelligence in healthcare? To answer the question, we adopt three unique viewpoints for a holistic analysis. Firstly, we conduct a comprehensive survey of the current progress of medical multimodal learning from the perspectives of datasets, task-oriented methods, and universal foundation models. Based on them, we further discuss the proposed question from five issues to explore the real impacts of advanced techniques in healthcare, from data and technologies to performance and ethics. The answer is that current technologies have NOT achieved universal intelligence and there remains a significant journey to undertake. Finally, in light of the above reviews and discussions, we point out ten potential directions for exploration towards the goal of universal intelligence in healthcare.

arxiv preprint arxiv, dataset, representation, (15 more...)

arXiv.org Artificial Intelligence

2408.1288

Country:

North America > United States (0.14)
Asia > China > Shaanxi Province > Xi'an (0.04)
Asia > Singapore (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Technology (1.00)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(8 more...)

Add feedback