multi-modal knowledge graph
Water flow in prairie watersheds is increasingly unpredictable -- but AI could help
In recent years, the Prairies have seen bigger swings in climate conditions -- very wet years followed by very dry ones. That makes an already unpredictable landscape even harder to forecast, with real consequences for flood preparedness and water quality. The challenge is the landscape itself. Much of the Canadian Prairies sit within the Prairie Pothole Region, a landscape dotted with millions of shallow wetlands and depressions. Water doesn't simply run downhill into a stream, it is stored first.
2026 AI Index Report released
The ninth edition of the Artificial Intelligence Index Report was published on 13 April 2026. Released on a yearly basis, the aim of the document is to provide readers with accurate, rigorously validated, and globally-sourced data to give insights into the progress of AI and its potential impact on society. The 2026 AI Index Report comprises nine chapters, covering: research and development, technical performance, responsible AI, economy, science, medicine, education, policy and governance, and public opinion. AI capability is accelerating and reaching more people than ever. Model performance continues to improve against benchmarks, and 80% of university students now use generative AI.
'Probably' doesn't mean the same thing to your AI as it does to you
'Probably' doesn't mean the same thing to your AI as it does to you When a human says an event is "probable" or "likely," people generally have a shared, if fuzzy, understanding of what that means. But when an AI chatbot like ChatGPT uses the same word, it's not assessing the odds the way we do, my colleagues and I found. We recently published a study in the journal NPJ Complexity that suggests that, while large language model AIs excel at conversation, they often fail to align with humans when communicating uncertainty . The research focused on words of estimative probability, which include terms like "maybe," "probably" and "almost certain." By comparing how AI models and humans map these words to numerical percentages, we uncovered significant gaps between humans and large language models.
A model for defect identification in materials
In biology, defects are generally bad. But in materials science, defects can be intentionally tuned to give materials useful new properties. Today, atomic-scale defects are carefully introduced during the manufacturing process of products like steel, semiconductors, and solar cells to help improve strength, control electrical conductivity, optimize performance, and more. But even as defects have become a powerful tool, accurately measuring different types of defects and their concentrations in finished products has been challenging, especially without cutting open or damaging the final material. Without knowing what defects are in their materials, engineers risk making products that perform poorly or have unintended properties.
Causal models for decision systems: an interview with Matteo Ceriscioli
How do you go about integrating causal knowledge into decision systems or agents? We sat down with Matteo Ceriscioli to find out about his research in this space. This interview is the latest in our series featuring the AAAI/SIGAI Doctoral Consortium participants. Could you start by telling us a bit about your PhD - where are you studying, and what's the broad topic of your research? The idea is to integrate causal knowledge into agents or decision systems to make them more reliable.
Identifying interactions at scale for LLMs
Understanding the behavior of complex machine learning systems, particularly Large Language Models (LLMs), is a critical challenge in modern artificial intelligence. Interpretability research aims to make the decision-making process more transparent to model builders and impacted humans, a step toward safer and more trustworthy AI. To achieve state-of-the-art performance, models synthesize complex feature relationships, find shared patterns from diverse training examples, and process information through highly interconnected internal components. In this blog post, we describe the fundamental ideas behind SPEX and ProxySPEX, algorithms capable of identifying these critical interactions at scale. We mask or remove specific segments of the input prompt and measure the resulting shift in the predictions.
Interview with Sukanya Mandal: Synthesizing multi-modal knowledge graphs for smart city intelligence
In their paper LLMasMMKG: LLM Assisted Synthetic Multi-Modal Knowledge Graph Creation For Smart City Cognitive Digital Twins, which was published in the AAAI Fall Symposium series, and introduced an approach that leverages large language models to automate the construction of synthetic multi-modal knowledge graphs specifically designed for a smart city cognitive digital twin. Here, Sukanya tells us more about cognitive digital twins, the framework they employed, and some key results. Could you start by introducing the idea of smart city cognitive digital twins and why this is an interesting area for study? Cities grow increasingly complex and interconnected, demanding sophisticated tools for management. A cognitive digital twin (CDT) serves as an AI-enabled virtual replica that models the dynamic interplay of physical and social systems, enabling simulations, predictions, and optimized operations.
Formal verification for safety evaluation of autonomous vehicles: an interview with Abdelrahman Sayed Sayed
In this interview series, we're meeting some of the AAAI/SIGAI Doctoral Consortium participants to find out more about their research. We sat down with Abdelrahman Sayed Sayed to chat about his work on formal verification applied to autonomous vehicles. Could you tell us a bit about where you're studying and the broad topic of your research? My PhD topic is formal verification of neural ODE (ordinary differential equations) for safety evaluation in autonomous vehicles. Could you say something about formal verification and why it's such an important topic?
Structured and Abstractive Reasoning on Multi-modal Relational Knowledge Images
Zhang, Yichi, Chen, Zhuo, Guo, Lingbing, Liang, Lei, Zhang, Wen, Chen, Huajun
Effectively reasoning about abstractive image inputs poses an elevated challenge for MLLMs, as it demands not only basic object recognition but also a deeper understanding and interpretation of the complex information encapsulated within these human-defined abstractive visual forms. Among the diverse array of abstractive images, an important area remains underexplored: ST ructured and A bstractive R easoning (ST AR) on images with M ulti-M odal R elational K nowledge (MMRK). As illustrated in Figure 1, MMRK consists of multiple multi-modal entities and concepts that are interconnected by abstract relational edges, representing well-organized and structured factual knowledge. Unlike natural or other abstractive images, MMRK offers a flexible and structured format for encoding complex semantic relations, with broad application potential (An et al., 2025). The relational links act as higher-order human-defined abstractions, modeling intricate connections among entities, and thus place greater demands on MLLM's reasoning capabilities. To accurately perform ST AR, MLLMs must understand both the entities and the underlying relational structure. However, ST AR remains largely unaddressed, with only a few studies (Zhang et al., 2024a; 2025d) briefly investigating this capability, which still face two critical challenges: (i) Lack of large-scale data synthesis method for ST AR. From the data perspective, there is a shortage of high-quality MMRK images and corresponding multi-modal instruction data. Automated pipelines for generating diverse and scalable MMRK datasets are missing, along with reliable chain-of-thought (CoT) reasoning annotations needed to improve MLLM's complex thinking and generalization ability.
Pedestrian Attribute Recognition via Hierarchical Cross-Modality HyperGraph Learning
Wang, Xiao, Wu, Shujuan, Cheng, Xiaoxia, Bi, Changwei, Tang, Jin, Luo, Bin
Current Pedestrian Attribute Recognition (PAR) algorithms typically focus on mapping visual features to semantic labels or attempt to enhance learning by fusing visual and attribute information. However, these methods fail to fully exploit attribute knowledge and contextual information for more accurate recognition. Although recent works have started to consider using attribute text as additional input to enhance the association between visual and semantic information, these methods are still in their infancy. To address the above challenges, this paper proposes the construction of a multi-modal knowledge graph, which is utilized to mine the relationships between local visual features and text, as well as the relationships between attributes and extensive visual context samples. Specifically, we propose an effective multi-modal knowledge graph construction method that fully considers the relationships among attributes and the relationships between attributes and vision tokens. To effectively model these relationships, this paper introduces a knowledge graph-guided cross-modal hypergraph learning framework to enhance the standard pedestrian attribute recognition framework. Comprehensive experiments on multiple PAR benchmark datasets have thoroughly demonstrated the effectiveness of our proposed knowledge graph for the PAR task, establishing a strong foundation for knowledge-guided pedestrian attribute recognition. The source code of this paper will be released on https://github.com/Event-AHU/OpenPAR