AITopics | Country

Collaborating Authors

Country

Localizing Knowledge in Diffusion Transformers

Neural Information Processing SystemsJun-15-2026, 01:57:08 GMT

Understanding how knowledge is distributed across the layers of generative models is crucial for improving interpretability, controllability, and adaptation. While prior work has explored knowledge localization in UNet-based architectures, Diffusion Transformer (DiT)-based models remain underexplored in this context. In this paper, we propose a model-and knowledge-agnostic method to localize where specific types of knowledge are encoded within the DiT blocks. We evaluate our method on state-of-the-art DiT-based models, including PixArt-α, FLUX, and SANA, across six diverse knowledge categories. We show that the identified blocks are both interpretable and causally linked to the expression of knowledge in generated outputs. Building on these insights, we apply our localization framework to two key applications: model personalization and knowledge unlearning. In both settings, our localized fine-tuning approach enables efficient and targeted updates, reducing computational cost, improving task-specific performance, and better preserving general model behavior with minimal interference to unrelated or surrounding content. Overall, our findings offer new insights into the internal structure of DiTs and introduce a practical pathway for more interpretable, efficient, and controllable model editing. 1

large language model, machine learning, natural language, (22 more...)

Neural Information Processing Systems

Country:

Europe (0.67)
North America > United States (0.46)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Media (0.68)
Leisure & Entertainment (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
(2 more...)

Add feedback

UniHG: ALarge-scale Universal Heterogeneous Graph Dataset and Benchmark for Representation Learning and Cross-Domain Transferring

Neural Information Processing SystemsJun-15-2026, 01:56:07 GMT

Irregular data in the real world are usually organized as heterogeneous graphs consisting of multiple types of nodes and edges. However, current heterogeneous graph research confronts three fundamental challenges: i) Benchmark Deficiency, ii) Semantic Disalignment, and iii) Propagation Degradation. In this paper, we construct a large-scale, universal, and joint multi-domain heterogeneous graph dataset named UniHG to facilitate heterogeneous graph representation learning and cross-domain knowledge mining. Overall, UniHG contains 77.31 million nodes and 564 million directed edges with thousands of labels and attributes, which is currently the largest universal heterogeneous graph dataset available to the best of our knowledge. To perform effective learning and provide comprehensively benchmarks on UniHG, two key measures are taken, including i) the semantic alignment strategy for multi-attribute entities, which projects the feature description of multi-attribute nodes and edges into a common embedding space to facilitate information aggregation; ii) proposing the novel Heterogeneous Graph Decoupling (HGD) framework with a specifically designed Anisotropy Feature Propagation (AFP) module for learning effective multi-hop anisotropic propagation kernels. These two strategies enable efficient information propagation among a tremendous number of multi-attribute entities and meanwhile mine multi-attribute association adaptively through the multi-hop aggregation in large-scale heterogeneous graphs. Comprehensive benchmark results demonstrate that our model significantly outperforms existing methods with an accuracy improvement of 28.93%. And the UniHG can facilitate downstream tasks, achieving an NDCG@20 improvement rate of 11.48% and 11.71%.

data mining, knowledge management, machine learning, (22 more...)

Neural Information Processing Systems

Country:

Europe (0.28)
Asia > China (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Media (1.00)
Information Technology (1.00)
Health & Medicine (1.00)
Leisure & Entertainment (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(5 more...)

Add feedback

Estimating cognitive biases with attention-aware inverse planning

Neural Information Processing SystemsJun-15-2026, 01:42:28 GMT

People's goal-directed behaviors are influenced by their cognitive biases, and autonomous systems that interact with people should be aware of this. For example, people's attention to objects in their environment will be biased in a way that systematically affects how they perform everyday tasks such as driving to work. Here, building on recent work in computational cognitive science, we formally articulate the attention-aware inverse planning problem, in which the goal is to estimate a person's attentional biases from their actions. We demonstrate how attention-aware inverse planning systematically differs from standard inverse reinforcement learning and how cognitive biases can be inferred from behavior. Finally, we present an approach to attention-aware inverse planning that combines deep reinforcement learning with computational cognitive modeling. We use this approach to infer the attentional strategies of RL agents in real-life driving scenarios selected from the Waymo Open Dataset, demonstrating the scalability of estimating cognitive biases with attention-aware inverse planning.

machine learning, reinforcement learning, simulation of human behavior, (19 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.46)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Industry:

Information Technology (0.93)
Automobiles & Trucks (0.92)
Transportation > Ground > Road (0.67)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Simulation of Human Behavior (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

1543d6d5cb976e4f9fbfaedf2e257967-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsJun-15-2026, 01:41:15 GMT

LCDB 1.1: ADatabase Illustrating Learning Curves Are More Ill-Behaved Than Previously Thought For the actual appendices, please see the main paper submission. Here, we would like to make a few2 notes regarding the dataset hosting.3 Self-Hosting Platform Our dataset is self-hosted on the 4TU.ResearchData platform, a trusted4 institutional repository based in the Netherlands, which guarantees long-term preservation of research5 data for a minimum of 15 years.16 Data Access Note We provide a public access link (also attached in the main submission).27 Machine Access via Croissant Metadata For machine access, Croissant metadata file can be8 found in our GitHub repository.39

artificial intelligence, repository, submission, (2 more...)

Neural Information Processing Systems

Country: Europe > Netherlands (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

1543d6d5cb976e4f9fbfaedf2e257967-Paper-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsJun-15-2026, 01:41:12 GMT

Sample-wise learning curves plot performance versus training set size. They are useful for studying scaling laws and speeding up hyperparameter tuning and model selection. Learning curves are often assumed to be well-behaved: monotone (i.e.

artificial intelligence, lcdb 1, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe (0.46)
North America (0.28)
Asia (0.27)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.92)
Information Technology (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.92)
(2 more...)

Add feedback

Starmer to confirm social media ban for U.K. teens ahead of G7 meet

The Japan TimesJun-15-2026, 01:33:00 GMT

Starmer to confirm social media ban for U.K. teens ahead of G7 meet U.K. Prime Minister Keir Starmer is expected to confirm a social media ban on children under 16 on Monday morning. U.K. Prime Minister Keir Starmer will start a crucial week for his premiership by announcing a package of strong restrictions designed to protect British teenagers from online threats. Starmer is expected Monday morning to confirm a ban on children under 16 using major social media platforms, as well as other measures including curfews on older teenagers and tough regulations on chatbots. He will then depart for a Group of Seven summit at Evian-les-Bains, France, where he faces awkward questions following last week's resignation of his defense secretary and uncertainty around the U.K.'s military budget. A ban on young teenagers using social media is popular with the U.K. public despite concerns around how effectively it can be enforced. The Labour government's new range of restrictions -- including some against chatbots and online games -- will go further than laws in Australia, according to a person familiar with the situation, where a ban on social media for teens came into effect last year.

artificial intelligence, chatbot, natural language, (11 more...)

The Japan Times

Country:

Europe (1.00)
Asia > Japan (0.65)
Asia > Middle East > Iran (0.41)

Industry: Government > Regional Government > Europe Government > United Kingdom Government (0.36)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.47)

Add feedback

CHiQPM: Calibrated Hierarchical Interpretable Image Classification

Neural Information Processing SystemsJun-15-2026, 01:28:11 GMT

Globally interpretable models are a promising approach for trustworthy AI in safetycritical domains. Alongside global explanations, detailed local explanations are a crucial complement to effectively support human experts during inference. This work proposes the Calibrated Hierarchical QPM (CHiQPM) which offers uniquely comprehensive global and local interpretability, paving the way for human-AI complementarity. CHiQPM achieves superior global interpretability by contrastively explaining the majority of classes and offers novel hierarchical explanations that are more similar to how humans reason and can be traversed to offer a built-in interpretable Conformal prediction (CP) method. Our comprehensive evaluation shows that CHiQPM achieves state-of-the-art accuracy as a point predictor, maintaining 99% accuracy of non-interpretable models. This demonstrates a substantial improvement, where interpretability is incorporated without sacrificing overall accuracy. Furthermore, its calibrated set prediction is competitively efficient to other CP methods, while providing interpretable predictions of coherent sets along its hierarchical explanation.

artificial intelligence, chiqpm, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > Germany (0.28)
Asia > Middle East (0.27)

Genre: Research Report > Experimental Study (1.00)

Industry:

Education (0.67)
Government (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Place Cells as Multi-Scale Position Embeddings: Random Walk Transition Kernels for Path Planning

Neural Information Processing SystemsJun-15-2026, 01:27:30 GMT

The hippocampus supports spatial navigation by encoding cognitive maps through collective place cell activity. We model the place cell population as non-negative spatial embeddings derived from the spectral decomposition of multi-step random walk transition kernels.

artificial intelligence, machine learning, navigation, (20 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England (0.28)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.89)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
(3 more...)

Add feedback

Adaptive Gradient Masking for Balancing ID and based Representations in Recommendation

Neural Information Processing SystemsJun-15-2026, 01:26:58 GMT

In large-scale recommendation systems, multimodal (MM) content is increasingly introduced to enhance the generalization of ID features. The rise of Multimodal Large Language Models (MLLMs) enables the construction of unified user and item representations. However, the semantic distribution gap between MM and ID representations leads to convergence inconsistency during joint training: the ID branch converges quickly, while the MM branch requires more epochs, thus limiting overall performance. To address this, we propose a two-stage framework including MM representation learning and joint training optimization. First, we fine-tune the MLLM to generate unified user and item representations, and introduce collaborative signals by post-aligning user ID representations to alleviate semantic differences. Then, we propose an Adaptive Gradient Masking (AGM) training strategy to dynamically regulate parameter updates between ID and MLLM branches. AGM estimates the contribution of each representation with mutual information, and applies non-uniform gradient masking at the sub-network level to balance optimization. We provide theoretical analysis of AGM's effectiveness and further introduce an unbiased variant, AGM*, to enhance training stability.

large language model, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country: North America > United States (0.67)

Genre: