AITopics | spatial information

State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding

Neural Information Processing SystemsJun-22-2026, 21:06:46 GMT

Recently, pre-trained state space models have shown great potential for video classification, which sequentially compresses visual tokens in videos with linear complexity, thereby improving the processing efficiency of video data while maintaining high performance. To apply powerful pre-trained models to downstream tasks, prompt learning is proposed to achieve efficient downstream task adaptation with only a small number of fine-tuned parameters. However, the sequentially compressed visual prompt tokens fail to capture the spatial and temporal contextual information in the video, thus limiting the effective propagation of spatial information within a video frame and temporal information between frames in the state compression model and the extraction of discriminative information. To tackle the above issue, we proposed a State Space Prompting (SSP) method for video understanding, which combines intra-frame and inter-frame prompts to aggregate and propagate key spatiotemporal information in the video. Specifically, an Intra-Frame Gathering (IFG) module is designed to aggregate spatial key information within each frame. Besides, an Inter-Frame Spreading (IFS) module is designed to spread discriminative spatio-temporal information across different frames.

artificial intelligence, information, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Asia > China (0.28)
North America (0.28)

Genre: Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision > Video Understanding (0.91)

Add feedback

Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism

Neural Information Processing SystemsJun-17-2026, 04:53:06 GMT

Multi-agent collaboration enhances the perception capabilities of individual agents through information sharing. However, in real-world applications, differences in sensors and models across heterogeneous agents inevitably lead to domain gaps during collaboration. Existing approaches based on adaptation and reconstruction fail to support pragmatic heterogeneous collaboration due to two key limitations: (1) Intrusive retraining of the encoder or core modules disrupts the established semantic consistency among agents; and (2) accommodating new agents incurs high computational costs, limiting scalability. To address these challenges, we present a novel Generative Communication mechanism (GenComm) that facilitates seamless perception across heterogeneous multi-agent systems through feature generation, without altering the original network, and employs lightweight numerical alignment of spatial information to efficiently integrate new agents at minimal cost. Specifically, a tailored Deformable Message Extractor is designed to extract spatial message for each collaborator, which is then transmitted in place of intermediate features. The Spatial-Aware Feature Generator, utilizing a conditional diffusion model, generates features aligned with the ego agent's semantic space while preserving the spatial information of the collaborators. These generated features are further refined by a Channel Enhancer before fusion. Experiments conducted on the OPV2V-H, DAIR-V2X and V2X-Real datasets demonstrate that GenComm outperforms existing state-of-the-art methods, achieving an 81% reduction in both computational cost and parameter count when incorporating new agents.

agent, artificial intelligence, information, (18 more...)

Neural Information Processing Systems

Genre:

Research Report > Experimental Study (1.00)
Research Report > Promising Solution (0.66)

Industry: Information Technology (0.68)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)

Add feedback

Estimating Nonlinear Neural Response Functions using GP Priors and Kronecker Methods

Cristina Savin, Gasper Tkacik

Neural Information Processing SystemsMay-1-2026, 05:36:35 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, kernel, machine learning, (19 more...)

Neural Information Processing Systems

Country: Europe (0.68)

Genre: Research Report (0.68)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

1da546f25222c1ee710cf7e2f7a3ff0c-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 00:11:28 GMT

Add feedback

151de84cca69258b17375e2f44239191-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 20:16:19 GMT

artificial intelligence, garment, machine learning, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Sensing and Signal Processing > Image Processing (0.68)

Add feedback

12e35d9186dd72fe62fd039385890b9c-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 18:50:54 GMT

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.46)

Industry:

Health & Medicine > Therapeutic Area > Neurology (0.93)
Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.70)
Information Technology > Communications > Networks (0.70)

Add feedback

10c272d06794d3e5785d5e7c5356e9ff-Paper.pdf

Neural Information Processing SystemsApr-24-2026, 18:12:38 GMT

artificial intelligence, information, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > China (0.28)

Industry:

Health & Medicine > Diagnostic Medicine (0.70)
Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (0.68)
Health & Medicine > Therapeutic Area > Oncology > Lung Cancer (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation

Neural Information Processing SystemsMar-22-2026, 11:35:25 GMT

However, traditional approaches frequently encounter issues like over-segmentation or mis-segmentation, due to insufficient emphasis on spatial information of instances. In this paper, we introduce a Rule-Guided Spatial Awareness Network (RG-SAN) by utilizing solely the spatial information of the target instance for supervision. This approach enables the network to accurately depict the spatial relationships among all entities described in the text, thus enhancing the reasoning capabilities. The RG-SAN consists of the Text-driven Localization Module (TLM) and the Rule-guided Weak Supervision (RWS) strategy. The TLM initially locates all mentioned instances and iteratively refines their positional information. The RWS strategy, acknowledging that only target objects have supervised positional information, employs dependency tree rules to precisely guide the core instance's positioning. Extensive testing on the ScanRefer benchmark has shown that RG-SAN not only establishes new performance benchmarks, with an mIoU increase of 5.1 points, but also exhibits significant improvements in robustness when processing descriptions with spatial ambiguity. All codes are available at https://github.com/sosppxo/RG-SAN.

artificial intelligence, information, proceedings, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.82)

Add feedback

80d46bb66ea003f4b29fa6013905d50a-Paper-Conference.pdf

Neural Information Processing SystemsFeb-19-2026, 06:53:25 GMT

dependency, information, representation, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Beijing > Beijing (0.05)
Asia > China > Guangdong Province > Guangzhou (0.04)
North America > United States > California > Los Angeles County (0.04)
(2 more...)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
(3 more...)

Add feedback

RG-SAN: Rule-GuidedSpatialAwarenessNetworkfor End-to-End3DReferringExpressionSegmentation

Neural Information Processing SystemsFeb-18-2026, 03:02:14 GMT

TGNN[24]introduce3D-RESby extending the bounding box annotations of ScanRefer [5] to masks by incorporating the instance masks from ScanNet and proposed a two-stage pipeline. Further, 3D-STMN [65] proposed an end-to-end method that matches the text and superpoints to get the 3D segmentation of the target object directly.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)
(3 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Filters

Collaborating Authors

spatial information

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

State Space Prompting via Gathering and Spreading Spatio-Temporal Information for Video Understanding

Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism

Estimating Nonlinear Neural Response Functions using GP Priors and Kronecker Methods

1da546f25222c1ee710cf7e2f7a3ff0c-Paper.pdf

151de84cca69258b17375e2f44239191-Paper.pdf

12e35d9186dd72fe62fd039385890b9c-Paper.pdf

10c272d06794d3e5785d5e7c5356e9ff-Paper.pdf

RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation

80d46bb66ea003f4b29fa6013905d50a-Paper-Conference.pdf

RG-SAN: Rule-GuidedSpatialAwarenessNetworkfor End-to-End3DReferringExpressionSegmentation