AITopics

Neural Information Processing SystemsJun-12-2026, 04:19:59 GMT

KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment

Maintaining comprehensive and up-to-date knowledge graphs (KGs) is critical for modern AI systems, but manual curation struggles to scale with the rapid growth of scientific literature. This paper presents KARMA, a novel framework employing multi-agent large language models (LLMs) to automate KG enrichment through structured analysis of unstructured text. Our approach employs nine collaborative agents, spanning entity discovery, relation extraction, schema alignment, and conflict resolution that iteratively parse documents, verify extracted knowledge, and integrate it into existing graph structures while adhering to domain-specific schema. Experiments on 1,200 PubMed articles from three different domains demonstrate the effectiveness of KARMA in knowledge graph enrichment, with the identification of up to 38,230 new entities while achieving 83.1\% LLM-verified correctness and reducing conflict edges by 18.6\% through multi-layer assessments.

large language model, natural language, proceedings, (5 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.65)

Neural Information Processing SystemsApr-27-2026, 01:21:52 GMT

49d1cf22327c51331cbd52bcb76a09a6-Supplemental-Conference.pdf

ConceptNet488 comprises commonly observed entities and their connections, where edge weights signify the re-489 liability and frequency of these relationships. To prevent the redundancy of common information and to maintain the validity of the enriched491 relations, we categorized the relationships based on their weights. Relationships with weights less492 than 1 were deemed "weak" and those with a weight of 1 were labeled "average". We refrained from493 using these categories for relation enhancement. Instead, only relationships with weights greater than494 1, indicative of high reliability, were employed for augmenting the relations.495

artificial intelligence, knowledge, natural language, (15 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.47)

Neural Information Processing SystemsFeb-11-2026, 09:45:14 GMT

49d1cf22327c51331cbd52bcb76a09a6-Paper-Conference.pdf

knowledge, openvik, visual knowledge, (13 more...)

Country:

Europe (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine (0.68)
Transportation (0.46)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Donated Christmas trees get a second life at the zoo

The evergreen trees give kangaroos, bison, lions, and more extra shelter and fun. Capybaras use donated Christmas trees as wind breaks to protect their habitats. Breakthroughs, discoveries, and DIY tips sent every weekday. The presents are unwrapped, the cookies are crumbs, and that real Christmas tree will become a fire hazard soon enough. Most of us haul it out to the curb for our local sanitation departments to take care of, but some lucky trees make it into the paws of animals living in zoos.

christmas tree, enrichment, zoo, (14 more...)

Popular Science

Country:

North America > United States > New Jersey > Cape May County (0.05)
Europe > United Kingdom > Scotland (0.05)

Industry:

Health & Medicine (0.99)
Leisure & Entertainment > Zoo & Circus (0.86)

Technology: Information Technology > Artificial Intelligence (0.36)

arXiv.org Artificial IntelligenceOct-22-2025

A Multi-Evidence Framework Rescues Low-Power Prognostic Signals and Rejects Statistical Artifacts in Cancer Genomics

Akarlar, Gokturk Aytug

Motivation: Standard genome-wide association studies in cancer genomics rely on statistical significance with multiple testing correction, but systematically fail in underpowered cohorts. In TCGA breast cancer (n=967, 133 deaths), low event rates (13.8%) create severe power limitations, producing false negatives for known drivers and false positives for large passenger genes. Results: We developed a five-criteria computational framework integrating causal inference (inverse probability weighting, doubly robust estimation) with orthogonal biological validation (expression, mutation patterns, literature evidence). Applied to TCGA-BRCA mortality analysis, standard Cox+FDR detected zero genes at FDR<0.05, confirming complete failure in underpowered settings. Our framework correctly identified RYR2 -- a cardiac gene with no cancer function -- as a false positive despite nominal significance (p=0.024), while identifying KMT2C as a complex candidate requiring validation despite marginal significance (p=0.047, q=0.954). Power analysis revealed median power of 15.1% across genes, with KMT2C achieving only 29.8% power (HR=1.55), explaining borderline statistical significance despite strong biological evidence. The framework distinguished true signals from artifacts through mutation pattern analysis: RYR2 showed 29.8% silent mutations (passenger signature) with no hotspots, while KMT2C showed 6.7% silent mutations with 31.4% truncating variants (driver signature). This multi-evidence approach provides a template for analyzing underpowered cohorts, prioritizing biological interpretability over purely statistical significance. Availability: All code and analysis pipelines available at github.com/akarlaraytu/causal-inference-for-cancer-genomics

artificial intelligence, machine learning, mutation, (18 more...)

2510.18571

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.91)

Neural Information Processing SystemsOct-8-2025, 15:15:21 GMT

Open Visual Knowledge Extraction via Relation-Oriented Multimodality Model Prompting

Existing methods on visual knowledge extraction often rely on the predefined format (e.g., sub-verb-obj tuples) or vocabulary (e.g., relation types), restricting the expressiveness of the extracted knowledge. In this work, we take a first exploration to a new paradigm of open visual knowledge extraction.

data mining, large language model, machine learning, (19 more...)

Country:

Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Health & Medicine (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(4 more...)

Asgari-Bidhendi, Majid, Ghaseminia, Muhammad Amin, Shahbazi, Alireza, Hossayni, Sayyed Ali, Torabian, Najmeh, Minaei-Bidgoli, Behrouz

Rezwan: Leveraging Large Language Models for Comprehensive Hadith Text Processing: A 1.2M Corpus Development

arXiv.org Artificial IntelligenceOct-7-2025

This paper presents the development of Rezwan, a large-scale AI-assisted Hadith corpus comprising over 1.2M narrations, extracted and structured through a fully automated pipeline. Building on digital repositories such as Maktabat Ahl al-Bayt, the pipeline employs Large Language Models (LLMs) for segmentation, chain--text separation, validation, and multi-layer enrichment. Each narration is enhanced with machine translation into twelve languages, intelligent diacritization, abstractive summarization, thematic tagging, and cross-text semantic analysis. This multi-step process transforms raw text into a richly annotated research-ready infrastructure for digital humanities and Islamic studies. A rigorous evaluation was conducted on 1,213 randomly sampled narrations, assessed by six domain experts. Results show near-human accuracy in structured tasks such as chain--text separation (9.33/10) and summarization (9.33/10), while highlighting ongoing challenges in diacritization and semantic similarity detection. Comparative analysis against the manually curated Noor Corpus demonstrates the superiority of Najm in both scale and quality, with a mean overall score of 8.46/10 versus 3.66/10. Furthermore, cost analysis confirms the economic feasibility of the AI approach: tasks requiring over 229,000 hours of expert labor were completed within months at a fraction of the cost. The work introduces a new paradigm in religious text processing by showing how AI can augment human expertise, enabling large-scale, multilingual, and semantically enriched access to Islamic heritage.

artificial intelligence, large language model, natural language, (19 more...)

2510.03781

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Pozzi, Riccardo, Barbera, Valentina, Principe, Renzo Alva, Giardini, Davide, Rubini, Riccardo, Palmonari, Matteo

Combining Knowledge Graphs and NLP to Analyze Instant Messaging Data in Criminal Investigations

arXiv.org Artificial IntelligenceOct-1-2025

Criminal investigations often involve the analysis of messages exchanged through instant messaging apps such as WhatsApp, which can be an extremely effort-consuming task. Our approach integrates knowledge graphs and NLP models to support this analysis by semantically enriching data collected from suspects' mobile phones, and help prosecutors and investigators search into the data and get valuable insights. Our semantic enrichment process involves extracting message data and modeling it using a knowledge graph, generating transcriptions of voice messages, and annotating the data using an end-to-end entity extraction approach. We adopt two different solutions to help users get insights into the data, one based on querying and visualizing the graph, and one based on semantic search. The proposed approach ensures that users can verify the information by accessing the original data. While we report about early results and prototypes developed in the context of an ongoing project, our proposal has undergone practical applications with real investigation data. As a consequence, we had the chance to interact closely with prosecutors, collecting positive feedback but also identifying interesting opportunities as well as promising research directions to share with the research community.

information retrieval, machine learning, natural language, (23 more...)

doi: 10.1007/978-981-96-0567-5_30

2509.26487

Country:

Europe > Italy (0.28)
Europe > Austria (0.28)

Genre: Research Report (0.82)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law (1.00)
Information Technology (1.00)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.93)
(2 more...)

Gliozzo, Alfio, Khan, Naweed, Constantinides, Christodoulos, Mihindukulasooriya, Nandana, Defosse, Nahuel, Rossiello, Gaetano, Lee, Junkyu

Transduction is All You Need for Structured Data Workflows

arXiv.org Artificial IntelligenceSep-30-2025

This paper introduces Agentics, a functional agentic AI framework for building LLM-based structured data workflow pipelines. Designed for both research and practical applications, Agentics offers a new data-centric paradigm in which agents are embedded within data types, enabling logical transduction between structured states. This design shifts the focus toward principled data modeling, providing a declarative language where data types are directly exposed to large language models and composed through transductions triggered by type connections. We present a range of structured data workflow tasks and empirical evidence demonstrating the effectiveness of this approach, including data wrangling, text-to-SQL semantic parsing, and domain-specific multiple-choice question answering. The open source Agentics is available at https://github.com/IBM/Agentics.

large language model, machine learning, natural language, (23 more...)

2508.1561

Genre: Workflow (1.00)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)