AITopics | spatial interaction

Collaborating Authors

spatial interaction

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Generating Origin-Destination Matrices in Neural Spatial Interaction Models

Neural Information Processing SystemsMar-22-2026, 11:07:20 GMT

Agent-based models (ABMs) are proliferating as decision-making tools across policy areas in transportation, economics, and epidemiology. In these models, a central object of interest is the discrete origin-destination matrix which captures spatial interactions and agent trip counts between locations. Existing approaches resort to continuous approximations of this matrix and subsequent ad-hoc discretisations in order to perform ABM simulation and calibration. This impedes conditioning on partially observed summary statistics, fails to explore the multimodal matrix distribution over a discrete combinatorial support, and incurs discretisation errors. To address these challenges, we introduce a computationally efficient framework that scales linearly with the number of origin-destination pairs, operates directly on the discrete combinatorial space, and learns the agents' trip intensity through a neural differential equation that embeds spatial interactions. Our approach outperforms the prior art in terms of reconstruction error and ground truth matrix coverage, at a fraction of the computational cost. We demonstrate these benefits in two large-scale spatial mobility ABMs in Washington, DC and Cambridge, UK.

artificial intelligence, name change, proceedings, (3 more...)

Neural Information Processing Systems

Country:

North America > United States > District of Columbia > Washington (0.27)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.27)

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

BrainCast: A Spatio-Temporal Forecasting Model for Whole-Brain fMRI Time Series Prediction

Gao, Yunlong, Yang, Jinbo, Xiao, Li, Huo, Haiye, Ji, Yang, Wang, Hao, Zhang, Aiying, Wang, Yu-Ping

arXiv.org Machine LearningMar-17-2026

Functional magnetic resonance imaging (fMRI) enables noninvasive investigation of brain function, while short clinical scan durations, arising from human and non-human factors, usually lead to reduced data quality and limited statistical power for neuroimaging research. In this paper, we propose BrainCast, a novel spatio-temporal forecasting framework specifically tailored for whole-brain fMRI time series forecasting, to extend informative fMRI time series without additional data acquisition. It formulates fMRI time series forecasting as a multivariate time series prediction task and jointly models temporal dynamics within regions of interest (ROIs) and spatial interactions across ROIs. Specifically, BrainCast integrates a Spatial Interaction Awareness module to characterize inter-ROI dependencies via embedding every ROI time series as a token, a Temporal Feature Refinement module to capture intrinsic neural dynamics within each ROI by enhancing both low- and high-energy temporal components of fMRI time series at the ROI level, and a Spatio-temporal Pattern Alignment module to combine spatial and temporal representations for producing informative whole-brain features. Experimental results on resting-state and task fMRI datasets from the Human Connectome Project demonstrate the superiority of BrainCast over state-of-the-art time series forecasting baselines. Moreover, fMRI time series extended by BrainCast improve downstream cognitive ability prediction, highlighting the clinical and neuroscientific impact brought by whole-brain fMRI time series forecasting in scenarios with restricted scan durations.

artificial intelligence, data mining, machine learning, (17 more...)

arXiv.org Machine Learning

2603.13361

Country:

North America > United States > Virginia > Albemarle County > Charlottesville (0.14)
Asia > China > Anhui Province > Hefei (0.05)
Asia > China > Jiangxi Province > Nanchang (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

HorNet: EfficientHigh-OrderSpatialInteractions withRecursiveGatedConvolutions

Neural Information Processing SystemsFeb-8-2026, 14:46:14 GMT

Based on the operation,we construct anewfamily of generic vision backbones namedHorNet.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Veneto > Venice (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Natural Language (0.68)

Add feedback

A Gravity-informed Spatiotemporal Transformer for Human Activity Intensity Prediction

Wang, Yi, Wang, Zhenghong, Zhang, Fan, Kang, Chaogui, Ruan, Sijie, Zhu, Di, Tang, Chengling, Ma, Zhongfu, Zhang, Weiyu, Zheng, Yu, Yu, Philip S., Liu, Yu

arXiv.org Artificial IntelligenceOct-27-2025

-- Human activity intensity prediction is crucial to many location - based services. Despite tremendous p rogress in modeling d ynamics of human activity, most existing methods overlook physical constraints of spatial interaction, leading to uninterpretable spatial correlations and over - smoothing phenomenon . To address these limitations, this work proposes a physics - informed deep learning framework, namely Gravity - informed Spatiotemporal Transformer (Gravityformer) by integrat ing the universal law of gravitation to refin e transformer attention. Specifically, it (1) estimates two spatially explicit mass parameters based on spatiotemporal embedding feature, (2) models the spatial interaction in end - to - end neural network using proposed adaptive gravity model to learn the physic al constrain t, and (3) utilizes the learned spatial interaction to guide and mitigate the over - smoothing phenomenon in transformer attention. Moreover, a parallel spatiotemporal graph convolution transformer is proposed for achieving a balance between coupled spatial and temporal learning. Systematic experiments on six real - world large - scale activity datasets demonstrate the quantitative and qualitative superiority of our model over state - of - the - art benchmarks. Additionally, the learned gravity attention matrix can be not only disentangled and interpreted based on geographical laws, but also improved the generalization in zero - shot cross - region inference . This work provides a novel insight into integrating physical laws with deep learning for spatiotemporal prediction . Index Terms -- Human activity intensity prediction; Gravity model; Spatial interaction; Physics - informed machine learning; Over - smoothing phenomenon; Spatiotemporal graph neural network . This work is supported by the National Natural Science Foundation of China ( Grant # 42430106, 42371468, 424B2013) . Y i Wang, Zhenghong Wang, Fan Zhang, Chengling Tang, Weiyu Zhang and Yu Liu are with Institute of Remote Sensing and Geographic Information System, School of Earth and Space Sciences, Peking University, Beijing 100871, China. Chaogui Kang is with National Engineering Research Center of Geographic Information System, China University of Geosciences (Wuhan) 430074, China. Sijie Ruan is with School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China . Di Zhu and Zhongfu Ma are with Department of Geography, Environment and Society, University of Minnesota, Twin Cities, Minneapolis, MN 55455, USA . Y u Zheng is with JD iCity, JD Technology, Beijing 100176, China . P hilip S. Yu is with Department of Computer Science, University of Illinois Chicago, Chicago 60607, USA .

artificial intelligence, deep learning, machine learning, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TPAMI.2025.3625859

2506.13678

Country:

Asia > China > Beijing > Beijing (0.85)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.55)
North America > United States > Illinois > Cook County > Chicago (0.45)

Genre:

Research Report > New Finding (0.67)
Personal > Honors (0.45)

Industry:

Information Technology (0.92)
Health & Medicine (0.88)
Transportation > Infrastructure & Services (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models

Jia, Mengdi, Qi, Zekun, Zhang, Shaochen, Zhang, Wenyao, Yu, Xinqiang, He, Jiawei, Wang, He, Yi, Li

arXiv.org Artificial IntelligenceSep-25-2025

Spatial reasoning is a key aspect of cognitive psychology and remains a bottleneck for current vision-language models (VLMs). While extensive research has aimed to evaluate or improve VLMs' understanding of basic spatial relations, such as distinguishing left from right, near from far, and object counting, these tasks cover only the most elementary layer of spatial reasoning and are largely approaching saturation in the latest reasoning models. In this work, we introduce OmniSpatial, a comprehensive and challenging benchmark for spatial reasoning, grounded in cognitive psychology. OmniSpatial covers four major categories: dynamic reasoning, complex spatial logic, spatial interaction, and perspective-taking, with 50 fine-grained subcategories. Through careful manual annotation, we construct over 8.4K question-answer pairs. Extensive experiments show that both open- and closed-source VLMs exhibit significant limitations in comprehensive spatial reasoning. We also explore two strategies-PointGraph (explicit scene graph cues) and SpatialCoT (novel-view chain-of-thought)-to bolster spatial reasoning.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2506.03135

Country:

North America > United States (0.46)
Europe > Austria (0.28)
Europe > Switzerland (0.28)

Genre: Research Report (1.00)

Industry:

Education (1.00)
Transportation (0.93)
Leisure & Entertainment > Sports (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ST-GS: Vision-Based 3D Semantic Occupancy Prediction with Spatial-Temporal Gaussian Splatting

Yan, Xiaoyang, Pei, Muleilan, Shen, Shaojie

arXiv.org Artificial IntelligenceSep-23-2025

3D occupancy prediction is critical for comprehensive scene understanding in vision-centric autonomous driving. Recent advances have explored utilizing 3D semantic Gaussians to model occupancy while reducing computational overhead, but they remain constrained by insufficient multi-view spatial interaction and limited multi-frame temporal consistency. To overcome these issues, in this paper, we propose a novel Spatial-Temporal Gaussian Splatting (ST-GS) framework to enhance both spatial and temporal modeling in existing Gaussian-based pipelines. Specifically, we develop a guidance-informed spatial aggregation strategy within a dual-mode attention mechanism to strengthen spatial interaction in Gaussian representations. Furthermore, we introduce a geometry-aware temporal fusion scheme that effectively leverages historical context to improve temporal continuity in scene completion. Extensive experiments on the large-scale nuScenes occupancy prediction benchmark showcase that our proposed approach not only achieves state-of-the-art performance but also delivers markedly better temporal consistency compared to existing Gaussian-based methods.

artificial intelligence, occupancy prediction, spatial reasoning, (16 more...)

arXiv.org Artificial Intelligence

2509.16552

Country: Asia > China (0.14)

Genre: Research Report (0.82)

Industry: Transportation > Ground > Road (0.35)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)

Add feedback

MSRFormer: Road Network Representation Learning using Multi-scale Feature Fusion of Heterogeneous Spatial Interactions

Yang, Jian, Wu, Jiahui, Fang, Li, Fan, Hongchao, Zhang, Bianying, Zhao, Huijie, Yang, Guangyi, Xin, Rui, You, Xiong

arXiv.org Artificial IntelligenceSep-10-2025

Transforming road network data into vector representations using deep learning has proven effective for road network analysis. However, urban road networks' heterogeneous and hierarchical nature poses challenges for accurate representation learning. Graph neural networks, which aggregate features from neighboring nodes, often struggle due to their homogeneity assumption and focus on a single structural scale. To address these issues, this paper presents MSRFormer, a novel road network representation learning framework that integrates multi-scale spatial interactions by addressing their flow heterogeneity and long-distance dependencies. It uses spatial flow convolution to extract small-scale features from large trajectory datasets, and identifies scale-dependent spatial interaction regions to capture the spatial structure of road networks and flow heterogeneity. By employing a graph transformer, MSRFormer effectively captures complex spatial dependencies across multiple scales. The spatial interaction features are fused using residual connections, which are fed to a contrastive learning algorithm to derive the final road network representation. Validation on two real-world datasets demonstrates that MSRFormer outperforms baseline methods in two road network analysis tasks. The performance gains of MSRFormer suggest the traffic-related task benefits more from incorporating trajectory data, also resulting in greater improvements in complex road network structures with up to 16% improvements compared to the most competitive baseline method. This research provides a practical framework for developing task-agnostic road network representation models and highlights distinct association patterns of the interplay between scale effects and flow heterogeneity of spatial interactions.

artificial intelligence, machine learning, road network, (17 more...)

arXiv.org Artificial Intelligence

2509.05685

Country: Asia > China (0.93)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

436d042b2dd81214d23ae43eb196b146-Paper-Conference.pdf

Neural Information Processing SystemsAug-22-2025, 00:09:20 GMT

interaction, spatial interaction, vision transformer, (11 more...)

Neural Information Processing Systems

Country:

Europe > Italy > Veneto > Venice (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Sensing and Signal Processing > Image Processing (0.93)
Information Technology > Artificial Intelligence > Natural Language (0.93)

Add feedback

Feasible Action Space Reduction for Quantifying Causal Responsibility in Continuous Spatial Interactions

George, Ashwin, Siebert, Luciano Cavalcante, Abbink, David A., Zgonnikov, Arkady

arXiv.org Artificial IntelligenceMay-26-2025

Understanding the causal influence of one agent on another agent is crucial for safely deploying artificially intelligent systems such as automated vehicles and mobile robots into human-inhabited environments. Existing models of causal responsibility deal with simplified abstractions of scenarios with discrete actions, thus, limiting real-world use when understanding responsibility in spatial interactions. Based on the assumption that spatially interacting agents are embedded in a scene and must follow an action at each instant, Feasible Action-Space Reduction (FeAR) was proposed as a metric for causal responsibility in a grid-world setting with discrete actions. Since real-world interactions involve continuous action spaces, this paper proposes a formulation of the FeAR metric for measuring causal responsibility in space-continuous interactions. We illustrate the utility of the metric in prototypical space-sharing conflicts, and showcase its applications for analysing backward-looking responsibility and in estimating forward-looking responsibility to guide agent decision making. Our results highlight the potential of the FeAR metric for designing and engineering artificial agents, as well as for assessing the responsibility of agents around humans.

agent, artificial intelligence, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2505.17739

Country:

North America > United States (1.00)
Europe (1.00)

Genre: Research Report > New Finding (0.34)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

GeoAI-Enhanced Community Detection on Spatial Networks with Graph Deep Learning

Liang, Yunlei, Zhu, Jiawei, Ye, Wen, Gao, Song

arXiv.org Artificial IntelligenceNov-22-2024

Spatial networks are useful for modeling geographic phenomena where spatial interaction plays an important role. To analyze the spatial networks and their internal structures, graph-based methods such as community detection have been widely used. Community detection aims to extract strongly connected components from the network and reveal the hidden relationships between nodes, but they usually do not involve the attribute information. To consider edge-based interactions and node attributes together, this study proposed a family of GeoAI-enhanced unsupervised community detection methods called region2vec based on Graph Attention Networks (GAT) and Graph Convolutional Networks (GCN). The region2vec methods generate node neural embeddings based on attribute similarity, geographic adjacency and spatial interactions, and then extract network communities based on node embeddings using agglomerative clustering. The proposed GeoAI-based methods are compared with multiple baselines and perform the best when one wants to maximize node attribute similarity and spatial interaction intensity simultaneously within the spatial network communities. It is further applied in the shortage area delineation problem in public health and demonstrates its promise in regionalization problems.

data mining, machine learning, node, (17 more...)

arXiv.org Artificial Intelligence

2411.15428

Country:

Europe > Netherlands > South Holland > Leiden (0.06)
North America > United States > Wisconsin > Milwaukee County > Milwaukee (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine > Public Health (0.88)
Health & Medicine > Therapeutic Area (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback